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PREFACE 


T hough this volume begins the study of modern differential geometry in 
earnest, it ends just when we have gotten to the fanciest definition of a 
connection, but have hardly begun to start spouting theorems. A glance at the 
Table of Contents will show that the semi-historical path promised in Volume I 
really has been followed. The most decisive encounters with classical differential 
geometry occur in Chapters 3 and 4, which present the classical papers and then 
explain them. While it is possible to get through this volume without reading 
any of the classical works themselves, the easy way out certainly misses all the 
fun! 

There are no Problem sets in this volume，which is a shame, but much of 
the material doesn’t lend itself to problems，and even if it did, I would have 
gone berserk trying to produce them in any reasonable length of time. As com¬ 
pensation for the lack of Problems, the final volume contains a comprehensive 
bibliography of the literature of Differential Geometry, including texts where 
problems may be found. 
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CHAPTER 1 


CURVES IN THE PLANE 
AND IN SPACE 


V olume I of these notes represents the “differential” part of differential 
geometry In this volume we finally get down to some geometry. For 
the present we are going to study only the simplest geometric objects, curves, 
and at first our approach will be terribly geometric. Nevertheless, the results 
of this chapter span a couple hundred years, and we will end with some very 
modern looking constructions. 

We begin by considering only curves in the plane, and we further restrict 
our attention to curves c : [a,b] M 2 which are immersions ， i.e., which satisfy 
c f (t)= : dc/dt ^ 0 for all t e [a, b]. For these curves, the arclength function 
[a, b] M, 



c\u)\ du, 


is a diffeomorphism s: [a,b] [0, L], where L = length of c. The curve 

y = cos~ l is then a reparameterization of c ； clearly y is parameterized by arc- 
length, I〆 ⑴ I = 1. We have just introduced a convention to be used throughout 
the chapter: for curves y : [a, b] M 2 with unit tangent vectors, we will usually 
denote a typical point in [a, b] by s, even at the risk of confusing it with the arc- 
length function defined above. We also emphasize that throughout this chapter 
y\s) just denotes a vector in M 3 , not a tangent vector in M 3 y ⑴， even though 
we will often draw it that way. 
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For an immersed curve c : [a,b] M 2 , we would like a way of measuring 

the amount that c is “curving” at any point. No matter how vague this，as yet 



intuitive, term may be，we will surely all agree that 

(a) a straight line is not curving at all ， 

(b) a circle of radius R > r is curving less than the circle of radius r. 

R 

. ®. 

If we have to attach a numerical measure of curvature to these particular curves, 

it seems reasonable to define the curvature of a straight line at any point to be 0, 

and to define the curvature of a circle of radius r to be 1 / r at any point. 

From these special cases we want to develop a definition that works for any 

curve. We take as a clue the procedure which we use in a similar, but simpler, 

case. We can easily define what we mean by the direction of a curve c : [a,b] 

M 2 at any time t. This direction is determined by the tangent line at c(t)^ which 

is the limit of lines through c(t) and c{t f ) as t f t. This limit exists if c f {t) 
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exists and is non-zero. If, moreover, c f is continuous at t, then this limit line can 
be described as the limit, as “，,2 — of the line through c(,i) and c(, 2 ); for 
this line is parallel to the tangent through c(§) for some § between and ^ 



In order to determine the curvature of a curve we follow an analogous pro¬ 
cedure. We find the circle which passes through three points c(,i) ， c(, 2 ) ， c(, 3 ) 
and then see if this circle approaches a limiting circle as h t. If it does, 
we can define the curvature of c at ^ to be the reciprocal of the radius of this 
circle. 

Before proving a precise theorem, we simply try to determine the position 
of this circle, assuming it does exist. This can be done as follows. First of all, 
since c is an immersion，it is locally one-one, so for distinct /i ，， 2，，3 near t, the 
points c(/i), c{t 2 ), c(^) are distinct. To be specific, let us say that t\ < t: < h. 
Suppose also that c(/i), c(^), c(^) do not lie on a straight line, so that there is a 
unique circle through these three points, with center C(,i,, 2 ， , 3 ). Consider the 
function 

t <c(,) — C(,i,,2 ， ,3), c{t) - C(t u t 2 ,h))- 

At “ ， ， 2,，3 this function has the same value, the square of the radius of the circle 
through c( ， i),c( ， 2 ),c( ， 3 ). So its derivative must be 0 at points e (t\ , ti ) and 
Hi ^ (, 2 ， , 3 ): 

(1) 0 = 〈 〆(&•)，— C(,i ， ,2,,3 )>， G + / = 1,2. 

Similarly, the function t c(r) — C(,i,, 2 ， , 3 )> must have derivative 0 at 

some point i] e 

(2) <〆’(")〆(")- C( ， i, ， 2 ，， 3)> =-<〆("),〆("”• 

Now, if the points C{t\,t 2 ,h) approach a point C as— ，， and if c n is 
continuous, then (1) and (2) clearly imply that 

(10 <〆( ， ) ， c(O-C) = 0 

(20 W )， c {t)-C)^-(c f {tU\t)). 
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The first of these equations shows that the circle through c(0 with center C 
must be tangent to c at c(,)，which is certainly to be expected. Thus C is already 



restricted to lie along a certain line. If c f \t) is not a multiple of c\t)^ the second 
equation then determines C, since it tells us the inner product of c(,) — C 
with 〆’(,)• If c ff {t) is a multiple of c\t), then we obtain the contradiction 

0 ^ — 〈 〆(,) ， 〆(,）〉= c(t) — C) = constant - 〈 〆(/)，— C) = 0. 

In other words, if 〆’(,）is a multiple of c\t) y this limiting position cannot exist. 

Although equations (1’) ，（ 2’）could be solved for C, we can make things a lot 
easier for ourselves by considering a curve c parameterized by arclength, so that 
|c’( l y)| =： 1. (This means that c f (s) always lies on the unit circle S l C M 2 , even 
though we often picture it instead as a tangent vector.) Now the equation 

<〆 ⑴， c’ ⑽ =1 


can be differentiated to give 

(*) 〈 c " ⑴， 〆 ⑽ = 0. 

In other words ， c " ⑷ is always perpendicular to 〆⑴ • In particular, 〆 ’⑷ is a 
multiple of c’Q) only when ("(s) = 0. Equations (1’）and (*) show that c’’(s) 
and c(^) — C are both perpendicular to 〆&)• If c"(s) # 0, then we can write 
c ⑴ 一 C = a . c"(s)• Substituting in (2’）gives 

a • = - 1 . 


Since we also liave 

| c * ⑷ -C\ = \a\ • I〆’ ⑴ I ， 


've easily deduce that 


|c (^) — C| = 


c" ⑷ I 


In other words, our circle is perpendicular to c at c(^), and has radius 
Its curvature, and hence the curvature of c at s, is thus |(?"(^)|. We are ready 
to reverse the order of this reasoning，and take care of details which we have 
ignored. 
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1. THEOREM. Let c : [a ， 办 ] —M be a C 2 curve parameterized by arclength. 
If c ff (s) ^ 0 , then for 51 , 52,53 sufficiently close to s, the points c(S 2 ) ， c(^ 3 ) 
do not lie on a straight line. As 51 , 52,^3 ^ s the unique circle through the 
points c(si) approaches a circle passing through c(s), whose radius is l/|c" ⑴ |, 
and whose center lies on the line through c(^) perpendicular to the tangent line 
through c ⑴ • If c"(5) = 0, then, even if the points c(^/) do not lie on a line, 
the circles through them do not approach a limiting circle. 

PROOF. We first show that if c ff {s) ^ 0 , then the points c(^i),c( 52 ),^*( 53 ) can¬ 
not lie on a line for ^ 1 , 52 , arbitrarily close to s. Whenever the points c(5/) 
lie on a line, for 5i < ^2 < ^ 3 , there are points G (^ 1 , 52 ) and §2 ^ (^ 2 , ^ 3 ) 
where the tangent lines are parallel to this line (Cauchy mean value theorem). 




This means that the curve c f in has = c’(§2). For and §2 close 

enough to 5 , the image § 2 ]) can’t be the whole of S l . So there is a point 

rj e (§ 1 ,^ 2 ) where c’（"）is furthest from c’ （ 6) in some direction along the circle. 
It follows that c ff {r]) — 0. This cannot happen for rj arbitrarily close to 5 , so the 
points c(^i), c( 52 ),^( 53 ) cannot lie on a line for S\,S 2 ,s^ arbitrarily close to s. 
Now let C be the unique point satisfying 

〈 〆 ⑴， c ⑴ -C)=0 

*) (c ff (s), c(s) -C) = -(c\s),c f (s)) = ~\. 

For 5i < ^2 < ^3 close to 5 , let C(5 i, 52,^3) be the center of the unique circle 
through the points c(si). We have already seen that 

(c'^), c(§) - C(s u s 2 ,s 3 )) = 0 I e ( 51 , 53 ) 

(c f \r]), c(r]) - C(s u s 2 ,s 3 )) = -(c\rf) ， c\r}Y) = -1 rj e ( 51 ,^ 3 ). 

Since c’ ( 专 ）^ c r (5) and c ff {r]) —> c ff {s) as 5 / ^ 5 , comparison of these equations 
with (*) shows that C(5 i, 52,53) must approach C. 

We have already shown that if c" ⑴ = 0, then C(^i, 52 , 53 ) cannot approach 
a limiting position as 5 / ^ s. ♦}► 
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The circle determined in Theorem 1 is called the osculating circle of c at ^ 
(“osculate” means to kiss). It is clearly the circle which best approximates the 



curve c at c(^). This suggests that we define |(："(^)| to be the curvature of c at 
even for c ff {s) = 0 this gives the result we would like. With this definition, cur¬ 
vature would always be non-negative, but we can modify the definition slightly 
so that we obtain a signed curvature. We will henceforth use the notation t(^) 
for c'y), the unit tangent vector of c at s. We use this notation only for curves 
parameterized by arclength; recall once again that t ⑷ g S l , even though we 
usually draw it as an element of M 3 c (_y). We then define n ⑴， the unit normal 
at s, to be the unit vector such that n(^) is perpendicular to t(,y), and [t(^), n(^)] 
is the standard orientation of R 2 . Thus n(^) = (—1 2 (^), t 1 ^)). Now we define 



the curvature a: ⑷ of c at ^ by the equation 


t’ ⑴ = 咖） . n ⑴. 


Notice that 


k(^)i = |t’Cy)| = k" ⑴ I ， 


so that I/Ia:^)! is the radius of the osculating circle at s, for Ar(^) ^ 0. The 
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significance of the sign of k is indicated in the figure below. 



For any curve c we can define a curve c “going in the opposite direction” by 
c(^) — c(— ^). If we use t(^) for c f (s), then clearly 

t(^) = — 1(— s )， hence n(,y) = —n(— s) 

This shows that the curvature k(s) of c at ^ is 

咖） = —k(—s). 

One can see this change of sign in the figure above. Traversing the left curve 
in the opposite direction, and turning the figure upside down, one obtains the 
curve on the right. 

This relation can also be seen from explicit formulas for /c, which are some¬ 
times useful to have. To write these, we abandon our usual practice of indicating 
component functions with superscripts, and instead write c(^) = (ci(^), C 2 (^)). 

We have, of course, \k(s)\ = y (^^(^)) 2 + (C2 ff (s)) 2 , but we can also develop a 
formula for k(s) itself. Since \k(s)\ is the length of c"(s)，which is perpendicular 
to the unit vector c’(s)，clearly \k(s)\ is also the area of the rectangle spanned 
by c'b) and This area is given by det(c / (5 , ), c // (5 , )) which, moreover, 



clearly has the same sign as So 


k{s) = det 


(ci f (s) 
\C2 f (s) 
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Until now we have been working exclusively with curves parameterized by 
arclength. Theoretically this is sufficient, since we consider only curves which 
can be reparameterized by arclength. In practice, however, it is usually very 
inconvenient to actually perform this reparameterization, which involves the 
inverse of a function defined as an integral. Moreover, since our formula in¬ 
volves only derivatives, only the integrand of this integral should play any crucial 
role. Consider any (immersed) curve c : [a, b] M 2 . Letting s : [a, b] [0, L] 
be arclength, and defining y = c o we have 


c = y o s 


d X 


dc 

i e — (t) 

5 dr } 


, ds 、 
y ⑽ ))5 ⑴ 


d 2 c 

~dt^ 


y f \s) 


rdsV 

\dt) 


d 2 s 


+ y f (s) 


Thus 


y f (s )= 


dc / ds 


y ,f {s) 


dt I dt 


d 2 c d 2 s dc / ds 

It 


d 2 c dc 


⑼ 2 


dt 2 dt 

(ds^ 

\di) 


say. 


Denoting da/dt by c/(0, we have for the curvature k : [a,b] M 2 of c», 


k(() = curvature of y at ^ = s(t) 

= det(y f (s),y ff (s)) 

= (ds/dt)~ 3 det(c(0, c(0 — ac(t)) 
=(ds/dt)~ 3 det(c(0, c(0)- 


Thus 


• • • • • • 

C\C 2 - C 2 C\ 

⑹ + c ' 2 2 )3 /2. 


Naturally, this formula becomes meaningless for a non-immersed curve, 
where c\(t) = r 2 (0 =0. In such cases, we should not generally expect to 
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have a meaningful notion of curvature, for there is nothing to stop the curve 
from “changing its curvature” at this point. 



Before examining the significance of the curvature function k further, we 
pause for an observation. Since the osculating circle is the limiting position 
of the circle through c^i), c^), c(^ 3 ) no matter how ^ 1 , ^ 2 , ^3 ^ it is clearly 
also the limiting position as 〆 —s of the circle which is tangent to c at 
and which passes through ( ： (〆)• To see this, we just choose ^ 1 , ^2 much closer 



to s than s f is. The center of the osculating circle can thus be described as the 
limiting position of the point S in the figure below as 〆 —These descriptions 
of curvature go back to Huygens, Leibniz, and Newton. 



We began our discussion of curvature by considering straight lines and cir¬ 
cles, which were our original models of curves which ought to have constant 
curvature. It is certainly clear that our definitions do assign curvature 0 to a 
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straight line c, for c" = 0 if c is parameterized by arclength; and absolute 
curvature |a:| = 及 to a circle of radius R, since the circle is its own osculating 
circle. A really satisfactory measure of curvature ought to assign constant cur¬ 
vature to these curves alone. It is clear that if c, parameterized by arclength ， 
has curvature 0 everywhere, so that c n = 0, then d is constant, so c is a straight 
line. The analysis of a curve c with non-zero constant curvature k (which we 
might as well assume positive) becomes frustratingly complicated if approached 
in too straightforward a way. We assume, as usual, that c is parameterized by 
arclength. Let us introduce the components of the unit tangent vector curve, 


t(>) = (a(s),P(s)). 

Then 

a f2 + p f2 =K 2 , 

so 

( 1 ) (/(/，+ p f p ff = 0 . 

Recall also that 〈 t ⑴， t’ ⑴〉 = 〈 〆⑴，(:”⑷〉 = 0, so 

( 2 ) otot f + = 0 . 


Equations (1) and (2) show that (a, jS) and (a’’ ， p ff ) are always perpendicular to 
(a’ ， j8’)，so one is a multiple of the other, 


(3) 


o: (^) = fi(s)a(s) 

P” （ s) = 


Moreover, differentiating (2) gives 


a f2 + p f2 + aot ff + pp” = Q 


or 

(4) K 2 + ⑽" + = 0. 

Substituting from (3)，and using a 2 + = 1， gives fi(s) = —k 2 . Thus, 


( 5 ) 


ot ff + K 2 a = 0 
+ k 2 P = 0. 
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The solutions of the differential equation in (5) are ^ a sin Ks~\~b cos ks, which 
can also be written s A sin(Ks + B). In order to have o; 2 + 多 2 = 1， we 
clearly need A = 1 and 

Qf(5*) = sin(Ks + B) 

P(s) = cos(ks + B). 

Thus t traverses a circle, which implies that c itself is a circle of radius \/k. 

The complicated calculations in the preceding paragraph conceal a basic 
principle which is much simpler. Suppose that we are given an arbitrary con¬ 
tinuous function k : [a ， 6] — R. We can ask how many arclength parameterized 
curves c : [a, b] M 2 there are with curvature function equal to k, without nec¬ 
essarily trying to find a specific formula for these curves. If there is one such 
curve, then there are automatically others, for translating or rotating a curve 
will not change its curvature. However，this is the only extent to which the curve 
is not determined. 

2. THEOREM. Let k : [a, b] M be continuous. Then there is a curve 
c : [a,b] M 2 , parameterized by arclength, whose curvature at s is for 

all s g [a, b]. Moreover, if c and c are two such curves, then c = Aoc where A 
is some proper Euclidean motion (a translation followed by a rotation [an ele¬ 
ment of SO(2)]). 

PROOF. Theorem 1.5-17 implies that there is a function t : [a, b] M 2 with 
(*) t\s) = k(s) . (-t 2 (^),ti(^)). 

We can choose t ⑷ arbitrarily; choose it to be a unit vector. Now 

(t 1 2 +t 2 2 ) / = 2t 1 t 1 / +2t 2 t 2 / 

= 2 ((ti, t 2 ), (V ， t 2 ’)> 

= 2((ti, t2),/C"(—12, ti)) 

= 0 . 

So t(^) is a unit vector for all s. There is, again by Theorem 1.5-17，a curve 
c : [a, b] R 2 with 〆($) = t(^). Since t^) is always a unit vector, c is parame¬ 
terized by arclength. Equation (*) then says that t’(s) = - n(^), so that k(s) 

is the curvature of c at s. 

If c and c have the same curvature functions then their unit tangent vec¬ 
tors t and t both satisfy (*)• Now if t is any solution of (*)，clearly Bot is also, for 
any rotation B. Choosing B so that B{t{a)) = t(a), and using the uniqueness 

of solutions of (*) with a given initial condition, we see that t = B o t. This 
implies that c differs from 5 o c by a translation. ♦> 
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Notice that this theorem makes the previous calculation unnecessary: Since a 
circle of radius R has constant curvature \/R, any curve with constant curvature 
1 /R differs from this circle by a Euclidean motion，and is consequently another 
circle of radius \/R. More generally，Theorem 2 seems to make further study 
of curves almost pointless. Although one may still study properties of curvature, 
there is clearly no point in introducing any similar concept; we would only be 
interested in concepts that remained the same for c and A o c, and all of these 
are already determined by the curvature. 

Despite these remarks，we are by no means ready to write off the study of 
plane curves. Many interesting results remain, of which we will be able to 
sample only a few. However, these results were all proved many years after the 
study of curves had been initiated，and are all global results, rather than local 
ones. To begin, we define certain kinds of curves, with which we will be almost 
exclusively concerned. 

AC 1 curve c : [a,b] M 2 is called closed if c(a) = c(b) and c f (a) = c f {b). 


I >/ these are 

closed curves 




One can also regard a closed curve as an immersion of 5 1 in M 2 . A curve 
is called simple if it is one-one. Finally，among the simple closed curves we 
distinguish a special class of curves called convex. These are defined to be 
the simple closed curves which always lie on one side of their tangent lines. 



Although the property enunciated in this definition is precisely the one which 
is used in all proofs about convex curves, we will nevertheless take time out to 
equate this definition with a more common one. 
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Any subset A of M 2 is called convex if the line segment ~pq from to ^ is 



convex sets non-convex sets 


contained in A whenever p,q e A. Suppose A is convex and is a point in 
the boundary of A. A line L through p is called a support line of ^4 if ^4 lies 
completely in one of the closed half-spaces into which L divides R 2 . 






4 support lines 
of a convex set 


3. PROPOSITION. If A is convex, and p is in the boundary of A, then there 
is at least one support line L through p. 

PROOF. If A has no interior points it lies on a line, and the proof is trivial. 
If A has an interior point q, let / be the ray from q through /?, and let V be the 
part starting at p. Clearly V intersects A only at p, for if a point q ! on V were 



in A, then all points between q f and the points in a neighborhood of q would 
be in A, so p would be an interior point of A. 
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Choose one side of V, and consider angles 9 such that rays from p making 
an angle of 9 with V on this side do not intersect A except at p (it may be that 
0 = 0 is the only possibility). Let 9\ be the least upper bound of all such 9, and 
let be the ray through p making an angle of 9\. Let I 2 be the corresponding 
ray on the other side of V. 



We claim that the angle between l\ and I 2 is > it, which will surely prove the 
theorem. To prove the claim, note that there are points of A arbitrarily close 
to (or perhaps even on) both l\ and , 2 . If the angle between and ,2 were 〈丌， 




the triangle containing a suitable pair of such points, and q, would contain p 
in its interior, which cannot happen, since p is not an interior point of A. ❖ 

J^ote (for those who are familiar with Banach spaces) • This theorem is essentially the 
Hahn-Banach Theorem. If A were symmetric about the origin, then A would 
be the unit ball in IR 2 for a Banach space norm || ||. If VF C M 2 is the subspace 
spanned by p, and X : W — M is the linear functional with X(p) — 1, then the 
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desired support line is just a translate of the kernel of an extension X:R 2 

of k with || 又 || = 1. Symmetry of A is really unimportant, for the Hahn-Banach 
Theorem only requires a norm satisfying ||au|| = a||u|| for a > 0. The proof 
given here is just a geometrical translation of the main step in the usual proof 
of the Hahn-Banach Theorem. 

We now want to show that a simple closed curve c is convex if and only if the 
set A consisting of all points on c or inside c is a convex subset of M 2 . This is 
going to be pretty hard, since we have never defined the inside of a simple closed 
curve, and are just assuming that the content of Corollary LI 1-15 is intuitively 
obvious. There is really no need to go through the proof of all this right now; 
it is only necessary to accept the following fact: 

Suppose c is a simple closed curve, and / is a ray from p which inter¬ 
sects c at just one point q ^ p. Suppose, moreover, that the tangent 
line of c at ^ does not lie along /. Then p is inside c. 



4. PROPOSITION. Let c be a simple closed curve, and let A be the set of all 
points on or inside c. Then c is convex (that is, c lies on one side of each of its 
tangent lines) if and only if A is convex. 

PROOF. Suppose first that A is convex. Any point /? on c is a boundary point 
of A, so there is a support line L of A through p. This line is clearly the tangent 
line of c at /?, so c lies on one side of the tangent line through p. 

Now suppose c is convex. For each p on c, let H p be the closed half-plane, 
bounded by the tangent line through p, in which c lies. Clearly A C H p . 
Since the intersection H p of all H p is convex (any intersection of convex sets 
is convex), it suffices to show that we actually have a = d p h p . So consider a 
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point q which is outside A. Let p be the point on c closest to q. Clearly the 
tangent line L of c at is perpendicular to the ray / from q through p. It 



suffices to show that c lies on the opposite side of L from for then q ^ H p . 
Now if c lay entirely on the same side of L as q, then c could not intersect the 
ray / at any other point; for it cannot intersect the open segment ~pq, since p 
is the closest point on c to q, and it certainly could not intersect / at points on 
the other side of L. By the remark preceding the Proposition, this would mean 
that 《 is inside c，a contradiction. 


Our first global results about curves depend upon a corresponding global 
formulation of the curvature function for a curve c : [a,b] M 2 . As usual, we 

assume c is parameterized by arclength，and consider its associated unit tangent 
vector curve t : [a,b] S 1 . Any point in S 1 can be described as (cos 0， sin 0) 
for a real number 9. Of course, it is not possible to do this continuously. More 
precisely, if we define /x: 1R ^ 5 1 by 


= (cos 9, sin 9) E 5 1 , 



M 




C 




5 1 




then there is no continuous function /: S 1 R with [i o f = identity. On 
the other hand, for our curve c: [a,b] R 2 there is a continuous function 
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/ •• [a,b] M with 


= t(^). 



5丌 

T 


More generally, 


5. PROPOSITION. Let y : [a,b] ^ 5 1 be continuous. Then there is a con¬ 
tinuous function /: [a, 6] ^ M with /x o / — y. Moreover, if / and / are any 
two such functions, then / — / = 2nk for some k. 

PROOF. For any t e [a, b] there is an open connected subset I t of [a, b] con¬ 
taining t such that y(It) is a proper connected subset of S l . Clearly /x _1 (y(/ f )) 
is then a disjoint union of connected sets and /x restricted to each of them is a 
homeomorphism onto y(It). This shows that if we choose any number v with 

—4;r —lit 0 lit 4n 

_> j_ 里 ■_I_ 



/x(u) = y(t), then / can be defined uniquely on I t in such a way that it has the 
value v at t, is continuous, and satisfies /x o / = y on I t . 

We first prove that if there are two continuous functions / and / with /xo/ — 
V, then they must differ by 2nk for some k. It obviously suffices to prove that 
/ = / if f{a) — f(a). Let A be the set of all t e [a,b] such that f(t) = f(t). 
Then A is closed, while the previous paragraph shows that A is open. Since 
a e A and [a,b] is connected, this shows that A — [a,b]. 
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To prove existence, consider the set B of all ^ e [a,b] such that a continuous / 
can be defined on [a, ^], and let ,o be the least upper bound of B. There is 
t\ e B with t\ < to and t\ G I t{) . Define a continuous / on I to (with any value 

u at ,o satisfying fi{v) = y(,o)). Then f (t\) — f(t\) — 2kn for some k. Now 
/ — 2kn must equal / on [,i ， ,o]，by the uniqueness proved previously. So we 

can extend / to be f — 2kn on [0 ， ,o]. This shows that ,o e 召 • If ,o 〈办 ， then 
we obtain an immediate contradiction by extending / to U I t{y 

If / is the function given by Proposition 5 for the curve t: [a, 6] ^ 5 1 , so 
that 

c\s) = t(^) = M(/(^)) = (cos f(s), sin f(s)), 

then 

c f \s) = (—/’ ⑴ sin/Q )， f(s) cos f(s)). 

So 


K(s) = C\{s)c 2 tf {s) - c 2 \s)c\ tf {s) 

=COS /( 小[(/⑷ COS f(s)] - sin /( 小 [—/ 々 ）sin / ⑴] • 


We thus have 

k(s) = 


Notice that this gives us an easy way to reconstruct the curve from its curvature 


function: We first reconstruct / as f(s) 


k( 9) d9\ this gives us t^) 


/x(/(s))，so one more integration gives us the curve. Notice also that 


m — f(a) = 



f\s)ds 



this quantity is called the total curvature of c. If c is a closed curve，the function 
t : [a,6] ^ 5 1 satisfies t(a) = t(6), so we may regard it as a map t; 5 1 ^ 5 1 . 
The total curvature then has a special interpretation. 


6. PROPOSITION. The total curvature of a closed curv r e c: [a, b] M 2 is 
2 兀 times the degree of the map t: S l S l . (The degree of a map is defined 
on pg. 1.275.) 
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PROOF. Since the form W on S 1 has integral 2 兀 ， the degree of t is 


、b 


b 


lit 




(jMo ff{d9) 


a 


2tc 


a 


、b 


•氺 


In 




a 


Now 




dt 


(-sin9,cos9)^0) ^ S l ^ey 


this is a unit tangent vector of S l , on which d9 has the value 1. So 


jut*(d9) = dt. 


Thus 


b 


degree of t 


In 


rm 


a 


、b 


In 


! dt 


a 


2n 


[f(b) - /(«)]•❖ 


The degree of t is also called the rotation index of c. The left-most figure 
below illustrates the first of our global theorems. 



f close to 2 tt 


rotation index 1 


rotation index 2 rotation index 0 
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7. THEOREM (THE HOPF UMLAUFSATZ). The rotation index of a sim¬ 
ple closed curve is 土 1 (depending on the direction in which it is traversed). 

PROOF. Let c : [0, L] M 2 be the curve, parameterized by arclength, and let 
A C M 2 be 

L 

A = {(^ 1 ,^ 2 ) ： 0 < < ^2 < c. 


A L 



We define 0: A ^ 5 1 by 


<t>(S U S2)= 


咖） 一 C ⑻ 
| 咖）一 c(s { )\ 


♦ = c\s) = t(^) 

0(0, L) = —c^O) = —1(0). 


^1 < S 2 and (A, # (0, L) 


It is easy to see that 0 is continuous. Now the map s ^>{s, s) isjust t : [0, L] 

S l . On the other hand, this map is homotopic to the map y obtained by 
applying (p to the curve which goes along the other two sides of the triangle, 
from (0,0) to (L, L). So it suffices to compute the degree of y, which we break 



into two pieces and y 2 , defined on [0, L] and [L, 2L], say. 

The rotation index of c clearly does not change if we rotate or translate c, 
so we can assume that c lies in the upper half-plane, with c(0) = (0,0), and 
that the tangent line at c(0) is the x-axis, as in the picture at the top of the next 
page. We assume that c is traversed in such a way that t(0) = (1,0). Now, 
Ki(^) = 0(0, clearly always lies in the semi-circle in the upper half-plane, 
and h(0) = t(0), while Y\{L) — — 1(0). Consequently, the function / given 
by Proposition 5 clearly has its image in [0,7r]，with /(0) = 0 and f(L) = it. 
Similarly, y 2 lies in the lower half-plane, so the / for y 2 satisfies f(L) = n and 
f(2L) = 2n, Thus the degree of t is 1 /2n - [2n — 0] = 1♦ 
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Using Theorem 7, we can now relate convexity and curvature. 


8. THEOREM. A simple closed curve c is convex if and only if its curvature k 
satisfies k >0 or k <0 (depending on the direction in which c is traversed). 

PROOF. Let c: [0, L] -> R 2 be parameterized by arclength, and choose a 
continuous /: [0, L] -> M with /x o / = t. 

If /c > 0, then f f > 0, so / is non-decreasing. Suppose that c were not 
convex, so that c lies on both sides of the tangent line / through some point p. 
There are points q\,qi on c which are furthest away from / on both sides of /. 



The tangent lines at q\ and qj are clearly parallel to /, so of the three unit 
tangents at these points, at least two are identical, say t^i) = t(^ 2 ) for < ^ 2 , 
with ^ (0, L). Thus fisj) — f{s\) is a multiple of In. But / is non¬ 

decreasing, and f{L) = / ⑼ + In, by Theorem 7. So either f (S2) = f(s\) or 
fisj) — f Cyi)+2;r. In the first case it follows that / is constant on [^ 1 , ^ 2 ], so t is 
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constant on pi，&]. This implies that c is a straight line on [A，&]，contradicting 
the fact that none of the points p,q\ y qi lies on the tangent lines through the 
others. If /(&) = /(^i) + 2n, the other arc of c must similarly be a straight 
line，which is again a contradiction. Thus c must be convex. 

Now suppose c is convex. If there is < ^2 with f(s\) = fisj), so that 
t(s\) = t(S2 )， then there is also s with t(s) = — t(si )， since t has degree 1， and 



is consequently onto 5 1 . The tangent lines through two of the three points 
c(^i), c(^ 2 ) must coincide (otherwise c would cross one of them). Thus c is 
tangent to the same line / at two points, c(^i) and c(^), say. These two points 
divide c into two arcs, a and P ， If V is perpendicular to /， and intersects / 



between c(t\) and then a and must intersect using convexity of c, 

it is easy to see that a and P intersect V exactly once. Clearly one arc，say a, 
always intersects V at a point further away from l than the other, p. We claim 
that P lies along l. If not, consider the tangent line through the point P of P 
furthest from /. This tangent line is parallel to /， and c would lie on both sides 
of it, a contradiction. 
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Now, since 卢 lies along /, we have t(^) = t(^). Thus t\ — and tj = sj. 
Since the curve lies along a line on we have f(s) = f(s\) for all s e 

Thus, / is non-decreasing. 

Our final global result about curves in the plane involves another local con¬ 
cept. A vertex of a curve c is a point where r’(s) = 0. It is easily seen that an 
ellipse which is not a circle has exactly four vertices, at the ends of the major 
and minor axes; these are the points where k has a local maximum or minimum 
(though a vertex need not generally be of this type). Before proceeding with the 
next theorem we need a preliminary observation. The curvature k is defined 
by 

t’O) = k{s) .nO); 

the existence of such a number k(s) follows from the equation (t,t) =： 1, by 
differentiation. Similarly, from (n,n) = 1 we obtain 

<n’0) ， n> =0 ， 

which implies that n'G) is a multiple of t(^), say = a - 1(^). On the other 
hand, from (t, n) = 0 we obtain 

<t’ ⑴ ， 11 (外 + <tO) ， n’G)> = 0 ， 

or 

k(s) + a = 0; 

hence, 

(*) n’C?) = —k(s) - 1. 

9. THEOREM (THE FOUR VERTEX THEOREM). Every simple closed 
convex curve has at least four vertices. 

PROOF. If c: [0, L] -> R 2 is the simple closed curve, parameterized by arc- 
length, then c has at least two vertices — namely, the maximum and minimum 
points for the curvature. Choose the coordinate system so that the x-axis passes 
through these two points. Now integration by parts gives the following equation, 


e2 
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in which we are taking integrals of M 2 -valued functions by integrating each 
component separately: 


f K f (s) - c(^) — — f k(s) - 1 (^) ds 

Jo Jo 


=I ds by (*) 

Jo 

= n(L) — n(0) = 0. 


Consequently, we certainly have 

f fc / (s)(c(s),e 2 ) ds = 0. 
o 

If there are no other vertices, then 〆 > 0 on one half of c and 〆 < 0 on the 
other. So K f (s){c(s), ei) has the some sign on both halves, contradicting (**)• 
Thus c must have at least one more vertex. 

The argument just given actually shows that c cannot be formed of two arcs 
with 〆 > 0 on one and 〆 < 0 on the other; the same conclusion clearly holds 
even if > 0 on one and /<r / < 0 on the other, since K f ^ 0 somewhere. This 
shows that c must have a fourth vertex; if it had only three, then some pair 
would divide c into two arcs with k > 0 on one and 〆 f 0 on the other. 


氺氺 


We now turn our attention to curves in space, and ask, once again, how to 
measure the curvature of c: [a,b] -> R 3 . We can still look for the limiting 
position of circles through c(^i), c(^ 2 ) ? ^(^ 3 ) as ^ If this limiting 

circle exists, its center C must satisfy 

(c\s), c(s) -C) = 0 

{c /f (s), c(s) -C) = -{c(s),c f (s)); 

the derivation of this necessary condition still works. However, for space curves 
these equations do not even determine C; the first equation merely restricts C 
to lie on a certain plane, not on a certain line. We must first see whether the 
planes through the points approach a limiting position. 


10 . PROPOSITION. Let c: [a, b] -> R 3 be a C 2 curve parameterized by arc- 
length, with ^ 0. For 心， $ 2 , M sufficiently close to s, the points c(^i), c(s 2 ), 
c(s^) do not lie on a line. As Si s, the unique plane through the points c(si) 
approaches the plane P spanned by c’( 5 *) and c f/ (s). 
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Remark: The plane P should really be described as the plane through c^) which 
is parallel to the plane spanned by 〆 ⑴ and c ff (s)^ but we will allow ourselves 
the elliptical terminology suggested by the picture. 



PROOF. Assuming for the moment that the points c(^/) do not lie on a straight 
line, let P(^i,^ 2 ^ 3 ) be the plane spanned by these points, and let a(^ 1 ,^ 2 ,^ 3 ) 
be a unit vector perpendicular to P(^i, ^ 2 , ^ 3 ). Then the function 

(*) ^ »-> (c(s), a(s u s 2 ,s 3 )) 

is 0 for s = Si. So we have 

(1) 〈 〆(&•)，= 0 g OfW+O ， / = 1 ， 2. 

It follows that 

(2) 〈 〆’(") ， aOi ， h ， s 3 )> = 0 r] e ($ 1 ,^ 2 ). 

Equations (1) and (2), together with continuity of d and c n ， clearly show that 
a{s\^S 2 ^ ^ 3 ) approaches a unit vector perpendicular to c’O and c ,f {s)^ so that 
P(^ 1 ,^ 2 ,^ 3 ) approaches P. 

If the points c^/) do lie on a straight line, then we can choose a whole circle 
of unit vectors a{s\ , S 2 , ^ 3 ) for which the function (*) vanishes at s 〜 If this 
were true for Si arbitrarily close to s, the remaining part of the argument would 
imply that all of these vectors are nearly perpendicular to P, which is absurd. ^ 

The plane described in Proposition 10 is called the osculating plane of the 
curve at s. Notice that, unlike the osculating circle, the osculating plane may 
exist even if c /f (s) = 0. For example, if c is a plane curve which is not straight, 
then the osculating plane certainly exists. The exact conditions for the existence 
of an osculating plane are not very important for us, but we will pause to indicate 
the actual state of affairs. 
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The worst possible situation occurs for the curve 

(0 t ^ 0 

c(t) = < t > 0 

少， o， 1 " 2 ) t < 0, 

which can’t make up its mind whether to osculate in the (x, j)-plane or in the 
(x, z)-plane. For this curve we have c (幻 (0) = 0 for all k > 2, which suggests 
that we consider only curves with c (幻⑴ # 0 for some k > 2. Notice that for 
curves parameterized by arclength,〈 c’ ⑴， = 0 implies 

{c f (s),c m (s)) + {c ff (s),c /f (s)) = 0 . 

So if c ,f (s) — 0, we have c ,,f (s) perpendicular to Similarly, c ,f, (s) = 0 

implies that e( 4 )Cy) is perpendicular to c f {s), etc. So the first non-zero higher 
derivative c (灸 ）(s) is the same as the first derivative which is linearly independent 
of Now suppose P is the plane spanned by c’ （ s) and this first non-zero 

Let ^3 be parameter values on the same side of s, i.e., ^ < 

^2 < ^3 (or < ^2 < ^3 ^ If a(^i,^ 2 ^ 3 ) is a unit vector perpendicular to 
all —小)， then, as before, we have 

〈c’fe)，a(^i,^2,^3)) = 0 

〈 〆’(")，= 0. 

Now r] g (^ 1 , ^ 3 ) so rj ^ s. Hence, if c ,f (s) = 0 we have 0 g (^, r]) with 

〈c"’(0)，a(^i,^2,^3)) = 0. 

Continuing in this way, we finally obtain k e (s, ^ 3 ) with 

〈c w ( 入)， = 0 . 

As before，this shows that the plane through the points c(^) approaches P, and 
that the points c (心） cannot lie on a line for Si arbitrarily close to 0 . 

If the Si are allowed to lie on both sides of s, this result is no longer true. For 
example, consider the curve 



c(t) = (t,t A ,t 3 ). 
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We have 

〆⑼ = (1，0,0) 

c" ⑼ = (0,0,0) 

，(0) = (0,0,6)， 

so (0,1,0) is perpendicular to the plane spanned by c’(0) and <?"’(())• On the 
other hand，for a vector perpendicular to the plane spanned by c(0), c(0, c{—t) 
we can choose 

normalized [c(0 — c(0)] x [c(—0 — c(0)] 

=normalized (/,/ 4 ,/ 3 ) x (—/,/ 4 , —t 3 ) 

= normalized (— 2 / 7 , 0, 2t 5 ) 

-> (0,0,1). 

This strange behavior is clarified by a look at the Taylor expansion 

^3 

c(^) = c(0) + sc ’ ⑼ + 0 + — 〆 " ⑼ + o(^ 3 ), 

6 

which shows that 

[c(s) - c(0)] X [c{t) - c(0)] = - l ^r c f (0) X c"’(0) 

o o 

+ higher order terms. 

When s and t have the same sign, the dominant term is the first. But when s 
and t have opposite signs this is no longer true—this term may even be 0. 
I suspect, but have not checked, that the parameter values Si may be picked on 
both sides of 0 if and only if the first k > 2 with c(々)(0) # 0 is even. 

For space curves c: [a,b] M 3 with c”{s 、^ 0, we now clearly have an 
osculating circle, the limit as 心,&， M s of the circle through the points c(^)； 
it lies in the osculating plane. We define the curvature k to be the reciprocal of 
the radius of this circle, so that 

This definition, once again, determines a curvature even when 〆’⑴ =t’ ⑴ = 
0. Unlike the case of plane curves, we cannot obtain a signed curvature, for 
there is no natural way to pick a vector orthogonal to t(^). However, when 
k(s) ^ 0, we can now define n ⑴ by the equation 

= ^(^) 11 (^), n(s) = normalized 
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The vector n ⑴ is called the principal normal of c at We also define the 

binormal b ⑷ by 

b(^) = t ⑴ x n ⑴， 

so that (t ， n ， b) is always a positively oriented orthonormal basis. 



Note that (b,b) — 1 implies that (b’，b〉= 0, so that b' is a linear combination 
of t and n. We also have (b，t〉= 0, which implies that 

(b’，t> = -(b^^ = -(b,n) = 0. 

Thus b f is actually a multiple of n, and we can define a new function r，the 

torsion, by 

b’ = —rn. 

Of course, we can define r only at points where n exists，i.e.，where c n ^ 0. 
This is analogous to the fact that k can be defined only at points where d ^ 0. 
The reason for choosing the negative sign in this equation will be explained in a 
moment; we first interpret the absolute value |r|. The function b: [a,b] S 2 
has an arclength function 

length of b on [a,.?] = / |^(«)| du 

Ja 

=I \r(u)\ du. 

Ja 

Consequently, \r(s)\ is the derivative of this arclength function. Since b is the 
perpendicular to the osculating plane, this derivative of the length of b can be 
thought of as the rate at which the osculating plane is changing. Thus |r| measures, 
in some sense，the rate at which the curve deviates from being a plane curve. 
Classically, curvature and torsion were also known as first and second curvature, 
and space curves were called curves of double curvature. 
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To develop a formula for the torsion, we first recall that the cross product 
u x w of u and w is defined by the equation 

(*) (z, u x w) = det I v 

\ w 

[Since the / th component of u x w is (e/, u x w), this shows that v x w can be 
obtained by computing the determinant 


e\ 

^2 



V2 

u 3 

W\ 

W2 

W3 




(v 2 W3 - w 2 V3)ei + 


purely formally.] Using the fact that det is alternating, the left side of (*)，clas¬ 
sically known as the “scalar triple product” (u, w, z), is seen to satisfy 

〈z ，v x w) = —(w, v x z) = {w, z x v). 


氺氺 


Now for the torsion r of a curve c parameterized by arclength we clearly 
have 

r = (-n,b') = (-n, (t x n)') 

=〈一n，t x n’〉+〈一n，t’ x n〉， where t^ x n = 0 


c" 

K 

,〃 


C X 


( 


c 


ff\f 


K 



Jtt 


C f ( KC 

—— ， f X I - 

K \ K 

1 

ff .J .. .Jft 


K f C 


ft 


2 


K 


2 


{c X c ), 


or 



This shows that r > 0 when t, n, c fff form a positively oriented basis for M 3 , 
which is the same as saying that c m is on the same side of the osculating plane 
as b. Now in Taylor’s formula, 

h 2 h 3 

c(s -\-h)= 中） + hc(s) + + —c m (s) + o(h\ 

2 6 

the term c*^) + hc f (s) + h 2 c ff (s)/l is in the osculating plane at s, so 


c(s + /z)— 


h 2 


c(s) + hc f (s) + 


h 3 


c 


m 


⑴ + 咖 3 ) 


L ^ J ^ 

points from the osculating plane to c(s + h). Consequently, if /"( 5 1 ) — 0, then 
the curve pierces the osculating plane at s, and the points + h) for small 
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h > 0 are on the same side as c’〃(x)，while points c(s + /z) for /z < 0 are on the 
other side. Together with our previous remarks，this shows that 

if r( 5 *) > 0, then points c(5* + h) for small /z > 0 lie on the same side 
of the osculating plane as b( l s , ), while points c{s + h) for small h < 0 



Our formula for r shows that the torsion for the curve c = s \-^ c(—5 1 ) has 
the same sign as that of c. This is because reversing directions also reverses the 
binormal. For a curve with an arbitrary parameterization we obtain, proceeding 
just as in the case of curvature, the formula 


• • 奢警 ••• » 

(c X c, c) 

r = - 

(c X C,C X c) 


A standard and possibly illuminating way of examining the geometrical sig¬ 
nificance of k and r is to examine the projections of c on the planes spanned 
by any two of t, n, b. The plane spanned by t and n is just the osculating plane. 



The plane spanned by the principal normal n and binormal b is called, naturally 
enough, the normal plane, and the plane spanned by t and b is called the 
rectifying plane (this terminology is explained in Volume III, pg. 186). We can 
choose a coordinate system for M 3 so that c(0) =0 and so that the osculating 
plane of c at 0 is the (x, j)-plane. Further choosing c’（0) = (1,0,0) we obtain 

c(0) = (0,0,0) 
c 7 (0) = (1,0,0) 
c’'0) = (0，，0) 
c 〃 ’( 0 ) = ( — ， — ,kt), 
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the last equation following from the fact that tk 2 = (c f x 〆’ ， 〆"〉，as shown 
above. The three components of the Taylor expansion 

2 3 

c(s) = c(0) + K’(o) + ^V’(O) + ‘〆 "⑼ + … 

2 6 

give 


C\ (5 1 ) = s + terms of order 3 or more 
k 2 

— —s + terms of order 3 or more 

KT ^ 

二一 s + terms of order 4 or more. 


So the projections look like 

y — -x 2 up to order 2 on the osculating plane 


z = —jc 3 up to order 3 on the rectifying plane 


2 lx 3 

z = - y up to order 3 on the normal plane. 



The figure below shows these projections for the curve on page 30. 
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Just as in the case of plane curves, we now ask to what extent the curvature 
and torsion determine a curve. 

We note first that a curve c (parameterized by arclength) with a: = 0 every- 
where is a straight line — the proof is the same as before. 

Moreover, a curve with r = 0 everywhere is a plane curve. To prove this, we 
note that r = 0 means b 7 = 0 so that b(.y) = bo, a constant vector. This implies 
that (t, bo) = 0. But this means that 

—(c^), bo) = 0, 
as 

so (c(s),b 0 ) — Cl where a is a constant. Thus c lies in the plane 

{/? e M 3 : (/?,b 0 ) = a). 

Unlike the case of plane curves, we should not expect a curve with constant 
curvature to be a circle, unless the torsion is 0. To get some idea of the variety of 
possibilities, we will examine only one special class of curves, the helices, given 
by the formula 

c(u) = (a cos w，a sin u, bu). 

2nb 

We have 

〆⑻ — (—a sin w, a cos u, bu), 

so _ 

\c r {u)\ = \J a 1 -\-b 2 = £>， 

and the reparameterization y by arclength is given by 

y(s) = c(s/D), 



with 
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Thus 

= \y f, (s)\ = 

(y’ x 〆’ ， 〆 "〉 b 
T= ^ 

Notice that neither k nor r depend on the sign of a. Changing a to —a merely 
rotates the helix through an angle of '丌 around its axis, which is the same as 
moving it a certain distance in the direction of this axis. However，changing 
from b to —b changes the helix from “right handed” to “left handed” or vice- 
versa, and accordingly changes the sign of r. 

a to —a b to —b 




By choosing suitable a and b, we can make \a\/D 2 and b/D 2 equal to any de¬ 
sired pair (a:, t) with a: > 0. So helices give examples of curves with any desired 
constant curvature (> 0) and constant torsion. Are they the only such curves? 
Rather than imitating the calculations for the simpler question answered pre¬ 
viously, we will immediately ask the more general question, whether k and r 
determine c up to a proper Euclidean motion. 

We begin with a recapitulation of the definitions: 

t’ = Kn 
b f = —in. 

Notice that we have expressed the derivatives of t and b in terms of the original 
vectors t ， n ， b. We can do the same for n. First, since (n,n) = 1, we obtain 
〈 n’ ， n 〉 = 0, so n’ is some linear combination of t and b. Now, from (n,t) = 0 
we obtain 

(n , t) ― _ (n, t ) — — (n, /cn〉= — k , 

(we already obtained this equation for the case of plane curves); and from 
(n,b) = 0 we obtain 


〈 n’ ， b〉= — 〈 n ， b’> = — 〈 n ， -rn) — r. 
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Thus we have ， altogether ， 


t f = 

KU 

n f = 

—Kt + rb 

b f = 

— rn 


These are called the Serret-Frenet formulas. They were obtained independently 
by Serret in 1851 ， and by Frenet, in his thesis of 1847 ， an abstract of which 
appeared in 1852 . Before Serret and Frenet, many geometric properties of 
curves were investigated with great laboriousness, but afterwards many of these 
investigations became routine, because, as our next theorem shows, everything 
about space curves is contained in these formulas. 

11. THEOREM. Let k, t : [a,b] M be continuous, with k > 0 on [a, b\. 
Then there is a curve c : [a,b] M 3 , parameterized by arclength, whose cur¬ 

vature and torsion functions are k and r. Any two such curves differ by a proper 
Euclidean motion (a translation followed by a rotation [an element of SO(3)]). 

PROOF. Let us adopt the more systematic notation Vi,V 2 ,V 3 for t,n,b, and 
define a matrix 

广 0 -K{s) 0 、 

^ 7 (^) =1 k{s) 0 -r ⑴， 

\ o r 。） 0 / 

so that the Serret-Frenet equations become 

3 

(*) V = 

Now Theorem 1.5-17 implies that there is a function s \-^ (yi^),V 2 (^),>" 3 (^)) 
on [a, b] satisfying (*)• We can choose Vi(a) arbitrarily; choose them to be 
orthonormal and positively oriented. We claim that h ⑴ are orthonormal for 
all s e [a, b]. This is the only significant point in the proof; the rest of the proof 
is exactly like the proof of Theorem 2. 

To prove that the V/ are always orthonormal, we note that 

3 

= ^2a ki {\j,\ k ) -\-a kj {yi,y k ). 
k~l 
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This shows that the functions P" = (v/, vy> satisfy the differential equation 

3 

(**) Pij — 〉 ： a kiPjk + a kj Pik ， 

k=\ 

together with the initial conditions Pij(a) = 8ij. Since the solutions of (**) are 
determined by their initial conditions, we can prove Y/ everywhere orthonormal 
by showing that the functions Pij(s) = 8ij do satisfy (**)• In other words, we 
want to show that 

3 

^ = ^ij = 〉 \ a ki 各 jk + a kj^ik 
k~\ 

—Clji + (lij. 

But this is true—the matrix (a") is skew-symmetric. ^ 

The skew-symmetry of (a") gives an easy way to remember the Serret-Frenet 
formulas: the formulas for t f and V are by definition, and the formula for n' is 
the one that makes the matrix skew-symmetric. If we think of t ， n，b as column 
vectors, so that (t ， n ， b) denotes a 3 x 3 matrix, then the Serret-Frenet equations 
read 

(U’ n〆 bi \ / ti ni bi \ / 0 -k 0 \ 

(t ， n ， b)’ = t 2 ’ n/ b/ j = 1 t 2 n 2 b 2 j 卜 0 

\W n 3 ' b 3 / \t 3 n 3 b 3 / \0 r 0 / 

Since t ， n, b are orthonormal and positively oriented, the curve = 

(t^), n( 1 s'),b(5')) is a curve in SO(3)，the identity component of the orthogo¬ 
nal group 0(3). Recall that the tangent space SO(n)r = O(w)/ = o(«) of 
SO(«) at / is the set of skew-symmetric matrices. For A e SO(«)，the tangent 
space SO(n)A is just ⑻)， which equals La • o(n), since La is a linear 

function. Thus SO(«)^ consists of all matrices A - M for M e o(n). Clearly, 
A ' M — La^(M) is just M(A), where M denotes the left invariant vector field 
with value M at /. Now the curve a : \a,b\ SO(3) must have its tangent 

vector in SO(3) a (_y)，so we must have 

• (skew-symmetric matrix). 

As we have just seen, this skew-symmetric matrix is just 

( 0 -K(s) 0 \ 

咖） 0 -T ⑷. 

V 0 r ⑺ 0 / 
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Moreover，this argument shows that skew-symmetry of this matrix is a necessary 
consequence of the fact that t ， n， b are orthonormal. By the same token，we can 
now present a more illuminating proof that the equations 

3 

(*) V = 

y=i 

in the proof of Theorem 11 have an everywhere orthonormal solution. The 
equation (*) for Qf^) = 12 ( 5 -), 13 ( 5 -)) says that 

(/(5 1 ) = of (5 1 ) - a(5-). 

This may simply be regarded as a differential equation on the manifold SO (3)，of 
the type considered in the Addendum to Chapter 5， so its solution is a curve 
in SO(3). 

The Lie group SO(3) is playing yet another, hitherto unmentioned, role in 

Theorem 11. The fact that k and r determine c up to a proper Euclidean 

motion is equivalent to the fact that k and r determine a = (t ， n ， b) up to an 

element of SO(3), since c is determined up to a translation by t. In other words， 

_ • 

if c is another curve with corresponding a = (t ， fi ， b )， and ic = k, r = then 
for some A g SO(3) we have 

(t ， S， b) = Lji o (t, n, b )， i.e.， 01 = o a. 

Now we already have a theorem telling us when the relation a = La 0 ct holds 
between two maps a, a : [a,b] SO(3). According to Theorem 1.10-18, this is 

the case if and only if for every left invariant 1-form co on SO(3). 

So k and r must have something to do with these left invariant 1-forms on SO(3). 
In order to see what is going on here，we begin with a review of some facts about 
Lie groups. 

In Chapter 1.10 we defined the natural (^-valued 1-form ⑴ on a Lie group G 

by a){a){X{a)) = X, where X is the left invariant vector field with X(e) = X. 
Thus <jl> is the unique left invariant c^-valued 1-form such that (o(e) \ G e ^ Q,= 
G e is the identity. If X\,..., X n is a basis of g，then we can write 

n 

(o = 〉 : (o^ - X( 

/ =1 

for certain ordinary left invariant 1-forms a ) 1 . This equation means that for any 
tangent vector Y a e G a we have 

n 

a){a){Y a ) = - Xt e 
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the dot denoting multiplication of X( e by the real number (o l (a)(Y a ). Clearly 

• • 

the o) 1 are a basis for the left invariant 1 -forms; in fact, the (o l (e) are the dual 
basis to Xi， …， X n . So the natural g-valued 1 -form co has all the left invariant 
1 -forms built into it. 

Now for Lie groups G which are subgroups of some GL (万， M) we have an 
explicit way of finding ⑴， and hence a basis of left invariant 1 -forms. Let P 
(for “point”）denote the inclusion map of G C GL(«, M) C M w2 into R n2 . Then 
P : G is an R n -valued function on G. Hence dP is an R n -valued 

1 -form on G. We can also think of dP as a matrix of ordinary 1 -forms on G. 
This matrix is just 

dP — 、 dx” 

• • • • 

except that dx lJ here denotes the differential of x lJ \G. Notice that dP takes a 

tangent vector in G/，i.e.， 2 o\ n x n matrix M，into itself，so dP corresponds to 
• . , . . , 2 
the identity map of Gj into itself. For any A e G, the map P o : G ^ 

is 

P o L^i(B) = A • B 7 

from which it is easy to see that 

(1) d(P oL a ) = A^ dP. 

On G we also have the C°° function B \-^ B -1 ， which we will denote (somewhat 
confusingly, perhaps) by P~ l . For A e G the map P~ l o : G ^ is 

(2) P~ l o L A (B) = (A . B)~ l = B~ l • A~ l = (P— 1 • 

Finally, P~ l - dP is a matrix of 1 -forms (or an -valued 1 -form). From (1) 
and ( 2 ) we have 

L A ^P~ l dP) = (P_ l 。 L A ). L A \dP) 

= o L A ) d{L/P) 

— (P— l °LA).d(PoLA) 

=(P~ l - A~ l ) • A • dP 

二 P- 、 dP ， 

so P~ l - dP is left invariant. Moreover, for any M g G/ we have 

P~ l • dP(I)(M) = I~ l - dP{I){M) - M. 

Hence P~ l - dP is the natural ^-valued \-form (o on G. 
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These considerations allow us to determine but it must be remembered 
that，unless G = GL(/i,M), the forms dx l ) are not linearly independent on G. 
Problem 1.10-24 determines co for G = GL(n, R), and 1.10-23 determines (o for 
the group G C GL(2, M) consisting of all matrices 



« 7 ^ 0 ; 


for this group, x n and x 12 can be taken as a coordinate system, and a> can be 
expressed in terms of x 11 and x n . The more general situation is illustrated 
by Problem 1.10-25, which we shall repeat here. The special linear group 
SL(«,M) C GL(«,M) is the set of all matrices of determinant 1. Its Lie al¬ 
gebra, ^l(n, M), consists of all matrices with trace 0. We can prove this from the 
fact that 

det exp M — e traceM , 

in the same way that we found o(n); the formula just given is proved in Prob¬ 
lem 1.10-15. Now we see that, if x u , x 12 , x 2 \ x 22 are denoted by x, then 

for SL(2, M) we have 


P~ l dP = 






v dx — y du 
—u dx + x du 


vdy — y dv 
-udy-^x dv 



Remember that x, y, u, v are not linearly independent, for the dimension of 
^1(2, M), and hence of SL(2, M), is clearly 3. In fact, we know that xv — yu = \ 
on SL(2, M), which shows that 

0 = d(xv — yu) = x dv v dx — y du — u dy 

on SL(2, M), i.e.，the right side gives 0 when applied to a matrix M e SL(2, M)^ 
for any A G SL(2, M). Looking at the formula for P _1 • dP, this shows that 
P~ l - dP takes such a matrix M to a matrix of trace 0, as it must. 

We will now use this circle of ideas to re derive our results about curves, in a 
more systematic, if less geometric, way. In order to clarify the general nature 
of the results, we begin once again with curves in the plane, but we now seek 
the answer to a different classification problem. We already know that if ^4 is a 
proper Euclidean motion, and c : [a,b] M 2 is any curve, then 

(a) ^4 o c is parameterized by arclength whenever c is; we express this fact by 
saying that arclength is a “natural parameter for curves under the group 
of proper Euclidean motions”. 
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(b) For curves c parameterized by arclength, the curvature function is in¬ 
variant under proper Euclidean motions, i.e., the function k for A o c 
equals the function k for c, whenever ^4 is a proper Euclidean motion. 
Moreover, the curvature is “a complete set of invariants for curves pa¬ 
rameterized by arclength”： if i for c equals k for c, then c — A o c for 
some proper Euclidean motion A. 

We now ask for similar results when A is allowed to be any “special affine 
motion” [a translation followed by any member of SL(2, M)]. 

Consider a curve c： [a, b] M 2 for which c f and c" are always linearly 
independent. Then det(c ; , c ff ) # 0 at all points. For simplicity we will assume 
that det(c / ,c // ) > 0, to avoid writing absolute value signs in various formulas. 
Our first task is to determine a curve a c - [a, 6] ^ SL(2, M) analogous to the 
curve (t，n) •• [a,b] SO (2). Remember that t is just c ! when c is parameterized 
by arclength, while n is basically just chosen so that we will have (t,n) e SO(2). 

For the case of the larger group SL(2, R) the choice of a c should be easier. 
In fact, the choice 

a c (t) = {c\t\c\t)) 

will work if c"(,)) = 1. So we ask if there is a reparameterization of c 

with this property. In other words, is there a function 

a : [a,b] [0, l] 

such that the reparameterization y = c o a~ x satisfies 

det(〆，〆’）=1. 

Since 

c — y o a 
c = a f - (y f o a) 

c” = (a’) 2 •(〆’ 。 a) + a" •(〆o a)， 

and thus 

det(c ; , c ff ) = det(a’ •（〆 。 a), (a’) 2 •(〆’ 。 a) + a〃 .（〆 。 a)) 

=(a’) 3 det(〆o a, 〆’ 。 a), 


we want 


det(c ,c ) = (a ). 

We can thus define a, the “special affine arclength” by 


a(r) = / Vdet(c’(w)，c’’(w)) du. 

a 
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When c satisfies det(c / , c ff ) = 1, we say that c is “parameterized by special 
affine arclength”, and the special affine arclength is a natural parameter for 
curves under the group of special affine motions. 

Now consider two curves c, c parameterized by special affine arclength. We 
will have c = Aoc for some special affine motion A — Bor, where B € SL(2, M) 
and r is a translation, precisely when 

ad(t) = B • a c (/), otd = Lb ° 


By Theorem 1.10-18, this is equivalent to the condition 


本 


(a/) = a c *((o l ) 


for every left invariant 1-form (o l on SL(2, M), and thus to the condition 


本 


㈣ = a c *((o) 


for the natural ^1(2, M)-valued from (o = P _1 - dP on SL(2,M). 

We are thus interested in calculating a c *(P— 1 • dP), which will be a matrix 
of 1 -forms on [a, b] 

_ dt _ dt \ 

_ dt _ dt y 

for simplicity we will write all the dt\ after the matrix. 

To calculate a c *(P _1 - dP) we can either use our formula for P _1 . dP and 
write 


a c \P~ l • dP) = ( V 

V —u 


v dx — y du v dy — y dv\ 


dx x du —udy + xdv) 


v oa c d{x o a c ) — 


參参争 


or, what amounts to the same thing, calculate 

- dP) — ot c ~^ - dot c , 


the entries of which are dt times 


[a c (r)] _1 -a c f (t). 
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We have 


Cl" 

Cl C2 f ) 3 


and using Kramer’s rule, together with the fact that 1 = det a ( 
we find that 


% _1 = ( 


-c\ f \ 
O’ / 


«c _1 乂 = ( 


Since 




ci" cr 

(?2 〃 C 2 m 


det(〆, c"): 


1 = det(c’ ， c ff ) = c\ C 2 — c{c\ 


and thus also 


O f tff t m 

= C\ C2 — C2 Cl , 


we obtain finally 


1 . a: 


0 W-W" 


H 


0 一 det(〆 ，， 〆": 

1 0 


Thus, curves parameterized by special affine arclength are determined, up to a 
special affine motion, by one function, the “special affine curvature” 


x = det(c\c rrr ). 

It is also possible to give a geometric interpretation of the curvature x, which 
we only briefly indicate. We first note that a curve c, parameterized by a, with 
constant curvature x satisfies 


[a c (o)]~ l - a c \(7) 


^c{o) = a c (a) • 


One solution of this matrix differential equation is 


otc{o) — exp {a 


0 —x 
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that is, 


⑻ = 









( COS y/Jca —y/x Sill yJ~KO 
y/x sin y/xa cos y/xa 


for ^ > 0, with a similar result involving hyperbolic trigonometric functions for 
^ < 0. For ^ = 0 we simply have 

The first column of these solutions give 
^ = 0 : 

c\o) = (l ， a) 

c(a) = constant -h (a ， a 2 /2)，a parabola 




^ > 0 : 

c{a) = (cos yfxo, v^sinv^a) 

c(a) = constant -h (1 /yfx sin ^/xa, —\/x cos y/xa), an ellipse 




^ < 0: 
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c(a) = constant + (V\x\ cosh \f\x\a, \/\x\ sinh y/\x\a), a hyperbola 



All other solutions : M —> SL(2, M) are special linear transformations times 
these, and hence are still conic sections. 

Now it turns out that as 0 \, 02 ,o^ cr, the parabola through c(ai), 0 ( 02 ), 
c(o^) approaches a given parabola, the osculating parabola, whose axis lies 
in the direction of c ff (s). And as a, the conic through the 

four points c(a/) approaches a given conic, the hyperosculating conic, whose 
curvature x is that of c at s. However, we will not prove these facts. 


We can now return to curves in space, and apply these ideas to classify curves 
under the group of proper Euclidean motions. With each curve c : [a,b] R 3 
we want to associate a curve a c : [a, b] SO(3). Assuming det(〆 ， c", c’"）> 0, 
the obvious choice is to let a c (0 be the result of applying the Gram-Schmidt 
orthonormalization process to these three vectors. Introducing the parameter¬ 
ization by (ordinary) arclength, this means that 


(Xc(s) = (c(s), 


|c " ⑴ I’ 


c ⑷、 


We now have 


0 


-k(s) 


0 


a c f (s) = a c (s) - I k(s) 0 -⑼ 


0 




0 


the 0 in position (3,1) comes about because we chose c / , c ff /\c ff \ as the first two 
columns of a c , while all other features of the matrix are accounted for by the fact 
that a c ~ l - a c f = a c *(P _1 - dP) is skew-symmetric, since a c : [a,b] SO(3). 
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Clearly, curves parameterized by arclength are determined up to proper Eu¬ 
clidean motions by k and t, and these functions are obviously just our old k 
and r. If we had bothered to compute a c ~ l - ol c ! for the original curve, not 
parameterized by arclength，then the entries in positions (2,1) and (3,2) would 
have given us the formulas for k and r，derived earlier，for curves with an arbi¬ 
trary parameterization. 

We can classify curves c: [a,b] R 71 under the group of proper Euclidean 

motions in a similar way. We assume first of all that det(c’，. • • ， c( rt )) > 0. To 
obtain a curve a c : [a,b] SO(«), we first introduce the parameterization by 

arclength. The first two columns Vi ? V2 of a c will be c\ c n j\c !! \. So the first 
column of a c ~ l - a c \ which expresses \\ f in terms of the v；, will be 

0 

kx 

0 

争 

0 

for k\ = Using skew-symmetry, a c ~ l - a c f looks like 

( 0 —k\ 0 

k\ 0 
0 0 

0 X 

Since V 3 , the third column of a c , is obtained by applying the Gram-Schmidt 
orthonormalization process to v! ， ” ， 〆"，it is clear that \2 will be a linear com¬ 
bination of \\ and V 3 , So the second column of 一 1 . a c f will be 

-k x 

0 

k 2 

0 



0 , 
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for some function k^. Thus 一 1 • a c f looks like 



Continuing in this way，we find that 



for n — l different “curvature functions’’ k\, … ， k n —\. These n — l functions 
classify c up to proper Euclidean motion. 

For this classification of curves in we have had to restrict our attention to 
curves with c f ， … ， c ⑻ everywhere linearly independent. If we have a curve c 
such that is linearly dependent on c ’，•••，<: (众一 ” along a whole interval 
[a ， fe]，then it is easy to see that on this interval c lies in some (k — ^-dimen¬ 
sional subspace of so that we actually have an easier classification problem 
on this interval. 

The difficulties arise when we try to piece together the information we obtain 
for the separate intervals, or if is linearly dependent on 〆 ， ••• ， c ( 灸 ― 1 ) at only 
isolated points (or at some sequence of points, etc.). For example，how can we 
hope to distinguish between the curve 
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and the curve 

ro r = o 

C(t) = \ , /t 2 

) t^O 

using functions like curvature and torsion? Of course, for analytic curves these 
difficulties cannot arise. If any small portion of one analytic curve differs by a 
proper Euclidean motion from a small portion of a second, then the two analytic 
curves themselves differ by this proper Euclidean motion. But for C°° curves, 
our restrictions are natural ones to make. 

Finally, we will use these ideas to classify curves c : [a,b] W 1 under the 
group of special affine motions of M w . We claim first that there is always a 
function o : [a,b] [0，/] such that the reparameterization y = c o a -1 satisfies 

det(〆 ， 〆’ ， ••• ， )/ ⑻ ） =1. 

In fact, since 

c = y o a 
c — o 1 • () / o a) 

c n — {a f ) 2 - (y ,f ◦ a) + ( ) • 〆 。 a 
c fff = (o 7 ) 3 •(〆"°a) + ( ) - y ff o o - 



we clearly need 

det(〆 ， … ， c {n) ) = a / • (a ， ) 2 … (a= (a / f (w+1)/2 , 
so we define the “special affine arclength” a in M w by 

o{t) — f ，，0,+I)/ ydet(r / , ..., ⑻). 

Ja 

Now if we consider only cur\ 7 es c: [a,b] with det(〆，•. • ， c( n )) = 1， then 

we can define a c : [a, b] SL(«, M) by 

a c — (c\ ... ，(: ( n ))， 
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where each is considered as a column vector. Solving the equation 

Oic ~ Off * Cl^ 

we see that the first column of a must be 



the second must be 



and so forth. The last column is arbitrary, except that the matrix a must have 
trace 0, so a c ~ l - a c f must be of the form 


for w — 1 “special affine curvature functions” , x n ~\\ they determine a 

curve, parameterized by affine arclength, up to special linear affine maps of M w . 

The special affine arclength o in W 1 can be introduced only when 

det(〆 ， ... ， c( n )) # 0. 

If along some interval the k vectors c f ， … ， c ⑻ are linearly independent, while 
the vectors 〆 ， ... ， e( 々）， c( 灸 +1 ) are linearly dependent, then it is easy to see that 
along this interval the curve c actually lies in some /:-dimensional plane in E w , 
and we can therefore introduce the special affine arclength for along this 
interval. 

The theory runs into troubles when there are isolated points at which 

det(c’ ， ... ， c ⑻ ）= 0. 
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In this respect it is similar to the ordinary theory of curves, but the situation 
is still different, because in the special affine case the function a which we 
choose as the special affine arclength will actually depend on the number k for 
which c’” • • ， are linearly independent—for curves in higher dimensional 
Euclidean spaces the special affine arclength involves more derivatives. The 
special affine curvature functions X\,X 2 ,X ^,... also involve higher derivatives 
than the corresponding curvature functions K \, a ： 2 , /C 3 ,... . This is just what we 
ought to expect — since the group SL(/i,M) is bigger than SO(«), we have to 
build more complicated things from our curve c before we can find something 
which is invariant under SL(«, M). 



CHAPTER 2 

WHAT THEY KNEW ABOUT 
SURFACES BEFORE GAUSS 


H aving traversed nearly the whole history of the study of curves, in one 
chapter, we now turn back to the beginnings of surface theory，and start 
the long journey toward the modern theory of higher dimensional manifolds. 
Even though the study of surfaces had begun long before the Serret-Frenet 
formulas appeared in 1847 , the theory of plane curves, at least, was well under¬ 
stood. 

The initial study of surfaces in K 3 began in a way that seems natural enough. 
Since we have a theory of curves in the plane, we may hope to describe surfaces 
by investigating the curves in which the surface intersects various planes. Indeed 
it turns out that we can describe the curvature of such curves in a surprisingly 
nice and precise way. 

The first results along this line, due to Euler, date from 1760 . Through a 
point /? on a surface M C M 3 we construct the line / which is perpendicular 
to M p . For each unit vector X e M p we can then consider the plane through p 



which contains both X and /. The intersection of this plane and M is the 
image of a curve cx with cj^(O) = p; we will also suppose c parameterized by 
arclength, so that 0 ^( 0 ) = A". We orient all these planes through / by choosing 
a vector v p perpendicular to M p and orienting the plane through X and / so 
that X, v p is positively oriented. Then cx has a signed curvature at 0 , which 
will be denoted by Kx- 

When we replace X by —X, the curve C-x is just cx traversed in the op¬ 
posite direction. Since the plane through X and v p now receives the opposite 
orientation, the curvature k—x equals the curvature kx- Thus ，X \-^ kx may 
be thought of as a function of directions in M p . Euler discovered a striking fact 
about the curvatures in these different directions: 
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Chapter 2 


1. THEOREM (EULER). If the Kx are not all equal, then there is precisely 
one direction, represented by a unit vector X \, say, in which kx has a minimum 
value k\ = kx x , and one in which it has a maximum value k 2 = kx? These two 
directions are perpendicular, and if X makes an angle of 9 with X\, then 


kx = k\ cos 0+^2 sin 9. 

(Notice that we can just as well write Kx = kj cos 2 </>-]- k\ sin 2 0, where 0 is the 
angle which X makes with since <f> = n/2 — 0, and consequently cos 2 9 — 

sin 2 <t> and sin 2 0 = cos 2 0. There is also a certain ambiguity in the statement of 
Euler’s theorem which does not affect the final result: If we change the vector v p 
so that it points in the opposite direction, then all curvatures Kx are changed 
to their negatives.) 


I do not know Euler’s proof of his theorem, and for reasons that will appear 
later, I am sure that the proof to be presented here is much simpler than the 
original. Nevertheless, it is sufficiently classical in spirit to serve as an historical 
substitute for Euler’s. 


PROOF. We begin by choosing our coordinate system so that p = (0,0,0) 
and so that the tangent plane at p is the (x, y)-plane, which means that in a 
neighborhood of p the surface M is {(x, y,z) : z = f (x, >^)}, where 


/( 0 , 0)=0 


夢(0,0) 

ox 


3 / 




a 少 


o 


(0,0) = 0. 



We now maintain that by rotating the (u)-plane we can arrange to have 

(0,0) = 0. 

To see this, we first recall that rotation through an angle of 9 radians is the linear 

‘ c • n • i . (cos 0 — sin 

transtormation Rg with matrix I .. 

、 sinc^ cos 

by an angle of 0, then M becomes the graph of 



If we rotate the (.v, j)-plane 



= / o R—0 ， 

fe(x\y) = f (x cos 9 — y sin 汐 ， x sin^ + y cos6). 
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So 


dh 

d 2 fo 

dxdy 


(x,y) = D\f( , )(-sin0) + D 2 f( , )(cos0) 


(x, y) = D\\f( , )(—sin0 cos0) + £) 12 /( , ) [- sin 2 9 + cos 2 9] 
+ 乃 22 /( ， )(sin0cos0) 


/n t omn /va a\ , ^22/(0, 0) - Z) n /(0, 0). 

^^-(0,0) = (cos20)Z)i2/(O,O) H --- Sin 26. 

dxdy 2 

In order to have d 2 fo/dxdy(0,0) = 0, we just choose 9 so that 

2 /) 12 /( 0 , 0 ) 

tan20 = ^n/^-Wmo) 心州射 

0 = tt/ 4 if^„/(0,0) = Z) 22 /(0,0). 

Having made this choice of coordinates, we now look at the various planes 
containing /, i.e.，containing the z-axis. First, the (x ， z)-plane intersects the 
surface in the curve 


^(0 = /(L 0)). 


Its curvature at 0 is therefore 


kx 







P 


C\C 2 - C 2 C\ 


3V 

dx 2 


( 0 , 0 ). 


(ci 2 + C2 2 ) 3/2 

This result holds when we give the (x,z)-plane its usual orientation, which 
is the same as the orientation making (ei) p , v p positively oriented when we 
choose v p to be {e^) p . Similarly, the (y, z)-plane intersects the surface in the 
curve c(t) = (t, /(0, t)) with curvature 

d 2 f 

ki = w(o,o). 

Finally, the plane through the z-axis which makes an angle of 9 with the x-axis 
intersects the surface in the curve 



c(t) = (/, f(t cos0, t sin 0 ))， 
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with curvature 


K 


g2 f 32 f 

f(t cos 0, t sin0) = cos 2 9 ( 0 , 0 ) + sin 2 0 -^-^-( 0 , 0 ) 


dt 2 

k\ cos 2 9 -\-kj sin 2 0. 


dx 2 




This formula shows that k always lies between k\ and kj, the curvatures in two 
perpendicular directions, and thus proves the whole theorem♦ 

Notice that if d 2 f/dx 2 ， 3 2 //3 少 2 > 0, so that the surface lies above the (x ， y)~ 
plane near p, then our choice of v p makes kx^kj > 0 . In general, if a surface 
locally lies on one side of its tangent plane through /?, then the choice of v p 
as a vector pointing toward this side makes k\,k 2 > 0 . If our surface is the 
boundary of a convex set in M 3 , we must therefore choose the inward pointing 
normal to obtain positive curvatures. If k\ and kj are of different signs, there 



is generally no such way to distinguish a direction for v p . 

The other main result about curves on surfaces, due to Meusnier，came shortly 
afterwards, in 1776 . Meusnier completed Euler’s investigations by finding the 
curvatures of the curves obtained by intersecting the surface M with any plane 
through /? g M. If P is the plane through p which contains / and a unit vector 
X g M p , then any other plane which contains X can be described by giving 
the angle 0 which it makes with P. Let c be the intersection of P and M，with 
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c (0) = p and c parameterized by arclength, so that c’(0) = X， and let be 
the corresponding intersection of and M. Meusnier’s theorem describes the 
curvature of at 0 in terms of the curvature Kx of c at 0: 

K<p - COS 0 = Kx • 

For the case of the unit sphere, pictured above, the curve c is a great circle, of 
radius 1， while is a circle of radius cos 0， so that = 1/ cos0. Naturally, 
in Meusnier’s theorem we must restrict </) to be less than n/2. At this angle, 
the plane is just the tangent plane, and does not generally intersect M in 
a curve at all; for 0 close to 7r/2, the plane intersects M in a curve of very 
large curvature (if Kx ^ 0). 

For Meusniers theorem we supply a proof which is decidedly non-classical 
in spirit, but which will be useful to have later on. We first define a function v 
in a neighborhood U C M of /? such that v{q) € M 3 is a unit vector and 
v{q) q € is perpendicular to M q for all ^ € C/. There are two choices for 
each v{q)\ in order to obtain a continuous function v: U ^ M, we orient C/, 
and then pick v(q) so that v(q), u, w is positively oriented for v q , w q positively 
oriented in M q . 



For any curve c in M with c(0) = /? we have 

(c’Q) ， ^(c^))) = 0 for all s. 

Differentiating this equation, we have 

心 ( 咖 )) \ 

ds S= J’ 

Now the vector dv(c(s))/ds \ s=0 , with components dv l (c{s))/ds \ s=() , depends 

on c’(0) = X; in fact, it equals X(v 2 ), X(v 3 )). To see this, just 

remember that to operate on a function /: M ^ M with a tangent vec¬ 
tor X p e M p , we can take any curve c with c’(0) = and then X p {f) — 
df {c(t))/dt\ t=0 . We can thus write 

(*) 〈 c 〃 (0) ， v(p)) = a{X) X = c’(0). 

Meusnier’s theorem follows directly from this equation: 


〈 c" ⑼, 


〆⑼， 
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2 . THEOREM (MEUSNIER). Let P be the plane through v(p) and X e M p , 
and let be the curvature of the curve on M cut out by a plane contain¬ 
ing X and making an angle of 0 with the plane P. Then 

K 小 . COS0 = KX. 

PROOF. First apply (*) to the curve c cut out by P. The second derivative c ff ( 0 ) 
of this curve is in P, and is also perpendicular to c ; (0) = X. Consequently, it 
is a multiple of v(p). Since P has been oriented so that X 7 v(p) is positively 
oriented, it follows that 0 /; ( 0 ) = kx - v (/?)， so 

a(I) = 〈 c" ⑼， ^(P)) = 

On the other hand，since 0^(0) = c r ( 0 ) = X 7 equation (*) also gives 

(1) kx — a(X) = (c^iO), v(p)). 

We can write C 0 "(O) = where is a unit vector which lies in and 

is perpendicular to X. Then makes an angle of 0 with v(p), which means 
that 

( 2 ) 〈 WO 》 =cos 0 . 

Combining equations ( 1 ) and (2)，we obtain 

— ( V ’( 0 )， 

=kH ' v 4>^ v (p)) 

=K 小 . COS 0 ♦ 

Despite the appealing simplicity of these results, there is something dissatis¬ 
fying about this whole approach of dissecting a surface into curves; we never 
seem to really get our hands on the surface itself. To do this, we must move 
forward 50 years in time. 



CHAPTER 3 

THE CURVATURE OF 
SURFACES IN SPACE 


A. HOW TO READ GAUSS 

T he single most important work in the history of differential geometry is 
Karl Friedrich Gauss’ paper, in Latin, of 1827: Disquisitiones generates circa 
superficies curvas. The following translation of (part of) this paper is basically 
the one published* by The Princeton University Library, 1902, except that it 
adheres even more closely to the notation and typographic disposition of the 
original. 

In addition，it has been supplemented with remarks designed to make this 
first confrontation with classical differential geometry much less painful. The 
translation of Gauss’ paper appears to the right — on odd-numbered pages — 
while corresponding remarks appear to the left. 

Although Part B of this chapter is an exposition of Gauss’ results, in modern 
notation, a preliminary reading of Gauss’ great work is heartily recommended; 
and since many of the difficulties will be clarified in Part B，as a general rule it 
is a good idea to read on, even if a particular section makes very little sense! 


* A reprinting was produced in 1965 by Raven Press, but this is also out of print. 
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REMARKS ON GAUSS 5 PAPER 


§1. Notice that (1) ，（2 ) ， (3) are used as the names of certain points [(1)= 
(1,0,0), etc.], a circumstance that is easy to forget later on. 



§2. This section gives a complicated proof, using spherical trigonometry, that 
the volume of the pyramid shown below is 



y 

/ 





and also includes remarks about the significance of the sign of the determinant. 
This result is equivalent to the well-known fact that | det A\ is the volume of the 
parallelepiped spanned by the rows of A. 

Almost all of this section can simply be skipped, except for noting that the 
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GENERAL INVESTIGATIONS 

OF 

CURVED SURFACES 


Investigations, in which the directions of various straight lines in space are 
to be considered, attain a high degree of clearness and simplicity if we employ, 
as an auxiliary, a sphere of radius — 1 described about an arbitrary center, and 
suppose the different points of the sphere to represent the directions of straight 
lines parallel to the radii ending at these points. As the position of every point 
in space is determined by three coordinates, that is to say, the distances of the 
point from three mutually perpendicular fixed planes, it is necessary to consider, 
first of all, the directions of the axes perpendicular to these planes. The points 
on the sphere, which represent these directions, we shall denote by (1), (2) ， (3). 
The distance of any one of these points from either of the other two will be a 
quadrant; and we shall suppose that the directions of the axes are those in which 
the corresponding coordinates increase. 

2 . 

It will be advantageous to bring together here some propositions which are 
frequently used in questions of this kind. 

I. The angle between two intersecting straight lines is measured by the 
arc between the points on the sphere which correspond to the directions of the 
lines. 

II. The orientation of any plane whatever can be represented by the great 
circle on the sphere, the plane of which is parallel to the given plane. 

III. The angle between two planes is equal to the spherical angle between 
the great circles representing them, and, consequently, is also measured by the 
arc intercepted between the poles of these great circles. And, in like manner, 
the angle of inclination of a straight line to a plane is measured by the arc drawn 
from the point which corresponds to the direction of the line, perpendicular to 
the great circle which represents the orientation of the plane. 

IV. Letting x, y, z; x\ z f denote the coordinates of two points, r the 
distance between them, and L the point on the sphere which represents the 
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expression cos (1)L, which will also appear later on, means the cosine of the 
angle 0 between the ray from (0,0,0) through the point L — (a,b,c) on the 



sphere and the ray from (0,0,0) through (1) = (1,0,0). Thus, for the usual 
inner product ( , ) on M 3 we have 

a = (L, (1)) = 1.1. cos0, 

so cos (1)L is the first component of L, and similarly for cos (2)L and cos (3)L. 

[The original contains cos (1)L 2 instead of cos 2 (1)L, etc., and multiplication 
is always indicated with a low dot . rather than a centered dot.] 
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direction of the line drawn from the first point to the second, we shall have 
x 1 = x-\-r cos (1)L, y r = y -\-r cos (2)L, 2 : 7 = 之 + r cos (3)L 

V. From this it follows at once that, generally, 

cos 2 (1)L + cos 2 (2)L + cos 2 (3)L = 1 


and also, if U denote any other point on the sphere, 

cos (1)L - cos ⑴ + cos (2)L - cos ⑵ Z/ + cos (3)L - cos ⑶ Z/ = cos LU 

VI. Theorem. If L, L\ L n , L m denote four points on the sphere, and A the 
angle which the arcs LL\L U L W make at their point of intersection，then we shall 
have 


cosLL” • cosZ/Z/" — cosLL”’ . cosZ/Z/’ = sinLIZ • sin d” • cos A 
Demonstration. Let A denote also the point of intersection itself, and set 

AL = t, AL f = V , AL r, — t ,f , AL W — t ,r, 

Then we shall have 

cos LL” = cosi cosi ,f +sini sin i 〃 cos A 
cosL’L/” = cosi'cosi"’+sin"sini"'cos A 
cos LL’” = cos i cos f 〃 + sint sin f 〃 cos A 
cosUL" — cost'cos t r, sin V sint" cos A 


and consequently, 

cosLL" • cos L f L ,f, — cos LI：” • cosL ， L” 

=cos A(cos t cos t ff sin V sin t ,n + cos V cos f sin i sin f 

— cos 亡 cos sinf sini" — cos V cosi" sint sin t ,f, ) 

=cos A(cosisini’ - sinicosi’)(cosi 〃 sint m - sini" cosf") 

=cos A - sin(i’ 一 i) • sin(i m — t ,r ) 

=cos A - sinLL' - sin L 〃 Z /〃 

But as there are for each great circle two branches going out from the point A, 
these two branches form at this point two angles whose sum is 180°. But our 
analysis shows that those branches are to be taken whose directions are in the 
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sense from the point LtoL\ and from the point L r, to L w \ and since great circles 
intersect in two points, it is clear that either of the two points can be chosen 
arbitrarily. Also, instead of the angle A, we can take the arc between the poles 
of the great circles of which the arcs LL\L”L”’ are parts. But it is evident that 
those poles are to be chosen which are similarly placed with respect to these 
arcs; that is to say, when we go from LtoL f and from L ,f to L m , both of the two 
poles are to be on the right, or both on the left. 


VII. LetL ， L r ， L” 

be three points on 

the sphere and set, for brevity, 

cos(l)L 

— X, 

cos(2)L : 

=y ， 

COS ⑶ Zi = 2 ： 

cos(l)L f 

_ /y» ’ 
tv ， 

cos (2)L f : 

= y ’， 

cos (3)L r 二 ％’ 

cos ⑴ _L 〃 

=x n , 

cos (2)L ,r : 

=y\ 

cos (3)L" 


and also 

xy f z r, + x’y”z + x”yz’ 一 xy”z’ 一 x’yz’’ 一 x r, y r z = A 

Let X denote the pole of the great circle of which LL r is a part, this pole being 
the one that is placed in the same position with respect to this arc as the point (1) 
is with respect to the arc (2)(3). Then we shall have, by the preceding theorem, 
yz f — y’z = cos(l)\ . sin (2)(3) • sinLL', or, because ⑵⑶ = 90°, 

yz’ 一 y’z = cos (l)\ - sin LL\ and similarly 
zx r — z f x = cos (2)\ - sin LL' 
xy r — x'y = cos(3)\ • sinLL r 

Multiplying these equations by x" ， y' z” respectively, and adding, we obtain, 
by means of the second of the theorems deduced in V, 

A = cos XX" • sin LL r 

Now there are three cases to be distinguished. First, when L” lies on the great 
circle of which the arc LL f is a part, we shall have \L n = 90°, and consequently, 
A = 0. If L ,f does not lie on that great circle, the second case will be when L f, is 
on the same side as \\ the third case when they are on opposite sides. In the last 
two cases the points L, L/ ， L” will form a spherical triangle, and in the second 
case these points will lie in the same order as the points (1), (2), (3), and in the 
opposite order in the third case. Denoting the angles of this triangle simply by 
L, L’ ， L” and the perpendicular drawn on the sphere from the point L n to the 
side LL/ by p, we shall have 

sin= sin L • sin LL" = sin L' • sin L’L ” ， and = 90 0 干 
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§3. This section merely defines (or tries to define) a differentiable surface, and 
its tangent plane at a point. 


§4. At a point A — (x, y, z) in the surface we have a unit normal vector va, and 
u G S 2 C M 3 is what Gauss calls L. The expression cos (1)L means the cosine of 



the angle between the rays from (0,0,0) through L and through (1) = (1,0,0) 
(c.f. page 58). So X, F, Z are just the components of L. Thus X, F, Z can be 
considered as functions on the surface [X(A) = first component of u, for a 
unit normal at A, etc.]. 

Gauss now nonchalantly introduces infinitely small quantities. The goal of his 
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the upper sign being taken for the second case, the lower for the third. From 
this it follows that 

±A = sinL.sinLL 、 sinLL 〃 = sin Z/.sin LZ/.sin L'L 〃 = sin L" .sin LL" .sin Z/L" 

Moreover, it is evident that the first case can be regarded as contained in the 
second or third, and it is easily seen that the expression 士 A represents six time 
the volume of the pyramid formed by the points L，U ， L” and the center of the 
sphere. Whence, finally, it is clear that the expression 士義 A expresses generally 
the volume of any pyramid contained between the origin of coordinates and the 
three points whose coordinates are x, y, z\ x\ y\ a：'; y” ， z". 

3. 

A curved surface is said to possess continuous curvature at one of its 
points A, if the directions of all the straight lines drawn from A to points of 
the surface at an infinitely small distance from A are deflected infinitely little 
from one and the same plane passing through A. This plane is said to touch 
the surface at the point A. If this condition is not satisfied for any point, the 
continuity of the curvature is here interrupted, as happens, for example, at the 
vertex of a cone. The following investigations will be restricted to such surfaces, 
or to such parts of surfaces, as have the continuity of their curvature nowhere 
interrupted. We shall only observe now that the methods used to determine 
the position of the tangent plane lose their meaning at singular points, in which 
the continuity of the curvature is interrupted, and must lead to indeterminate 
solutions. 

4. 

The orientation of the tangent plane is most conveniently studied by means 
of the direction of the straight line normal to the plane at the point A, which is 
also called the normal to the curved surface at the point A. We shall represent 
the direction of this normal by the point L on the auxiliary sphere, and we shall 
set 

cos (1)L = X, cos (2)L = Y, cos (3)L = Z 

and denote the coordinates of the point A by x y y, z. Also let x + dx, y + dy ， 
之 + d；s be the coordinates of another point A r on the curved surface; ds its 
distance from A, which is infinitely small; and finally, let X be the point on the 
sphere representing the direction of the element AA\ Then we shall have 


dx = ds . cos (1)\, dy = ds • cos (2)\, da ： = ds • cos (3)\ 
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initial manipulations is the equation 

X dx Y dy Z dz = 0. 

If x, y, z are considered as functions on the surface (that is, as the restriction 
to the surface of the standard coordinate functions on R 3 )，then this equation 
is literally true, interpreting dx^dy^dz as modern differentials. It should be 
easy to see this (remember how X, F, Z are defined). Also try to follow Gauss’ 
argument. 

The rest of section 4 gives formulas for F, Z in terms of different descrip¬ 
tions of the surface; in each case the formulas are paired with their negatives, 
since there are two different choices for the unit normal vector: 

(1) If the surface is {p e ffi. 3 : W(p) = 0}, for VF: E 3 ^ ]R, then 

P 

X = — , where P = D\W, Q = D 2 W, R = etc. 

[The original has XX + YY-^ZZ = 1, and PP^QQ^RR for P 2 +0 2 + i? 2 , 
and so forth, with a superscript 2 used only for the square of a term that is not 
a single letter.] 


(2) If the surface is the image of /: M 2 ^ M 3 [Gauss writes dx for d(x 。 /)= 
df' ， etc.] and 

a = D\f\ a f = D 2 f l 
b = D x f\ b f - D 2 f 2 
c = D x f\ c f - D 2 f 
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and, since \L must be equal to 90°, 


Zcos (1)\ + Fcos (2)X + Zcos (3)X = 0 


By combining these equations we obtain 

X dx + F + Z da ： = 0 

There are two general methods for defining the nature of a curved surface. 
Thefirst uses the equation between the coordinates x ， y ， z ， which we may suppose 
reduced to the form W — 0 f where W will be a function of the indeterminants x, 
y y z. Let the complete differential of the function W be 

dW = Pdx Qdy Rdz 

and on the curved surface we shall have 

PdxQdy Rdz = 0 

and consequently, 

Pcos (1)X + Qcos (2)K + 72 cos (3)X — 0 


Since this equation, as well as the one we have established above, must be true 
for the directions of all elements ds on the curved surface, we easily see that X, 
Y, Z must be proportional to P, Q, R respectively, and consequently, since 

X 2 -hY 2 + Z 2 = l 


we shall have either 


X 


P 


V(P 2 + Q 2 + i2 2 )， 


Y 


Q 




Z 


R 


W2 + Q2 + 况 2) 


or 


X 


P 


户 + Q 2+ 舻)’ 


Y 


-Q 


^ 2 + 0 2 + ^ 2 ) ? 


Z 


-R 


V(P 2 + Q 2 + R 2 ) 


The second method expresses the coordinates in the form of functions of two 
variables, p, q. Suppose that differentiation of these functions gives 


da: — adp-\-a r dq 
dy = b dp + b r dq 
dz = c dp + c r dq 
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then 


bd — cb f 

X = - , etc. 

A ， 


(3) If the surface is {(x, y,z) : z = f(x,y)} for /: M 2 ^ M 3 , then 

t 

X = t ， where t = D\ f, u — Dif, etc. 

Vl+， 2 +〆 

It should not be hard to work out these results, using our terminology. Again, 
it is instructive to follow Gauss’ derivations as well. 


§5. This section talks about orienting the surface，so that one can choose be¬ 
tween the two unit normals. 
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Substituting these values in the formula given above, we obtain 

{dX bY cZ) djo + {d r X b r Y c r Z) d(^ = 0 

Since this equation must hold independently of the values of the differentials dp, 
dq, we evidently shall have 

aX + bF + cZ = 0 ， a f X + = 0 

From this we see that X, Y, Z will be proportional to the quantities 

be’ 一 cb’ ， ca f — ac\ ab f — ba f 

Hence, on setting, for brevity, 

V / ((bc , — cb’) 2 + (ca' — ac') 2 + (ab r — ba , ) 2 ) = A 

we shall have either 

v be' ~cb' ca' —ac' „ ab r — ba’ 

A = —-—， 1 = ―-—， Zi = --- 

A A A 


v cb' - be' ac' - ca' „ ba r -ab' 

A = -， i ~ -， = - 

A J A J A 

With these two general methods is associated a third, in which one of the 
coordinates ， 之， say, is expressed in the form of a function of the other two, x, y. 
This method is evidently only a particular case either of the first method, or of 
the second. If we set 

dz — tdx -\-udy 

we shall have either 

y _ ^ 1_ 

V(1 + t 2, + W 2 )’ y/(l + t 2 + U 2 )’ y/(l + t 2 + U 2 ) 


X = ___ y — _^Z = 一丄 

V(i + P + w 2 )’ Vu + P + w 2 )’ V(i + P + " 2 ) 


The two solutions found in the preceding article evidently refer to opposite 
points of the sphere, or to opposite directions, as one would expect, since the 
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§6. In this section Gauss considers the map y, from the surface to S 2 , which 
takes A to the unit vector v which is normal to the surface at that point. The 
map v can be used to take any subset R of the surface to a subset y(i?) of S 2 . 
The area of y(i?) is referred to by Gauss as the total curvature of R. Then the 
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normal may be drawn toward either of the two sides of the curved surface. If 
we wish to distinguish between the two regions bordering upon the surface, and 
call one the exterior region and the other the interior region, we can then assign 
to each of the two normals its appropriate solution by aid of the theorem derived 
in Art. 2 (VII)， and at the same time establish a criterion for distinguishing the 
one region from the other. 

In the first method, such a criterion is to be drawn from the sign of the 
quantity W. Indeed, generally speaking, the curved surface divides those 
regions of space in which W keeps a positive value from those in which the 
value of W becomes negative. In fact, it is easily seen from this theorem that, 
if W takes a positive value toward the exterior region, and if the normal is 
supposed to be drawn outwardly, the first solution is to be taken. Moreover, it 
will be easy to decide in any case whether the same rule for the sign of W is to 
hold throughout the entire surface, or whether for different parts there will be 
different rules. As long as the coefficients P, Q, R have finite values and do not 
all vanish at the same time, the law of continuity will prevent any change. 

If we follow the second method, we can imagine two systems of curved lines 
on the curved surface, one system for which p is variable, q constant; the other 
for which q is variable, p constant. The respective positions of these lines with 
reference to the exterior region will decide which of the two solutions must be 
taken. In fact, whenever the three lines, namely, the branch of the line of the 
former system going out from the point A asp increases, the branch of the line of 
the latter system going out from the point Aas^ increases, and the normal drawn 
toward the exterior region, are similarly placed as the x, y, z axes respectively 
from the origin of abscissas (e.g., if, both for the former three lines and for the 
latter three, we can conceive the first directed to the left, the second to the 
right, and the third upward), the first solution is to be taken. But whenever the 
relative position of the three lines is opposite to the relative position of the x, y, 
z axes, the second solution will hold. 

In the third method, it is to be seen whether, when 2 : receives a positive 
increment, x and y remaining constant, the point crosses toward the exterior or 
the interior region. In the former case, for the normal drawn outward, the first 
solution holds; in the latter case, the second. 

6 . 

Just as each definite point on the curved surface is made to correspond to a 
definite point on the sphere, by the direction of the normal to the curved surface 
which is transferred to the surface of the sphere, so also any line whatever, or 
any figure whatever, on the latter will be represented by a corresponding line 
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curvature at a point A in the surface is defined as 

total curvature of R 
area of R 

where R is the “surface element 53 at A, which is supposed to have infinitely 
small area. As a first approximation to what Gauss is trying to say, we might 
define the curvature as 

total curvature of R 

lim --- 

area of R 

where the limit is taken as R approaches the point A. It is not a priori so clear 
whether this limit exists, or if it depends on the way in which R “approaches” 
Gauss also gives considerable discussion to the sign of the curvature. 
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or figure on the former. In the comparison of two figures correvsponding to one 
another in this way, one of which will be as the map of the other, two important 
points are to be considered, one when quantity alone is considered, the other 
when, disregarding quantitative relations, position alone is considered. 

The first of these important points will be the basis of some ideas which 
it seems judicious to introduce into the theory of curved surfaces. Thus, to 
each part of a curved surface inclosed within definite limits we assign a total or 
integral curvature, which is represented by the area of the figure on the sphere 
corresponding to it. From this integral curvature must be distinguished the 
somewhat more specific curvature which we shall call the measure of curvature. 
The latter refers to s. point of the surface, and shall denote the quotient obtained 
when the integral curvature of the surface element about a point is divided by 
the area of the element itself; and hence it denotes the ratio of the infinitely 
small areas which correspond to one another on the curved surface and on the 
sphere. The use of these innovations will be abundantly justified, as we hope, 
by what we shall explain below. As for the terminology, we have thought it 
especially desirable that all ambiguity be avoided. For this reason we have 
not thought it advantageous to follow strictly the analogy of the terminology 
commonly adopted (though not approved by all) in the theory of plane curves, 
according to which the measure of curvature should be called simply curvature, 
but the total curvature, the amplitude. But why not be free in the choice of words, 
provided they are not meaningless and not liable to a misleading interpretation? 

The position of a figure on the sphere can be either similar to the position 
of the corresponding figure on the curved surface, or opposite (inverse). The 
former is the case when two lines going out on the curved surface from the same 
point in different, but not opposite directions, are represented on the sphere by 
lines similarly placed, that is, when the map of the line to the right is also to 
the right; the latter is the case when the contrary holds. We shall distinguish 
these two cases by the positive or negative sign of the measure of curvature. 
But evidently this distinction can hold only when on each surface we choose a 
definite face on which we suppose the figure to lie. On the auxiliary sphere we 
shall use always the exterior face, that is, that turned away from the center; on 
the curved surface also there may be taken for the exterior face the one already 
considered, or rather that face from which the normal is supposed to be drawn. 
For, evidently, there is no change in regard to the similitude of the figures, if on 
the curved surface both the figure and the normal be transferred to the opposite 
side, so long as the image itself is represented on the same side of the sphere. 

The positive or negative sign, which we assign to the measure of curvature 
according to the position of the infinitely small figure, we extend also to the 
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§7. In this section Gauss finds a formula for the curvature k at A. His answer ， 
at the top of page 77, is given for a surface which is the graph of /: M 2 — M 3 . 
In this case, the functions X and Y can be thought of as functions on M 2 (that 
is, we consider X o f and Y o /)，and Gauss’ answer is 


k 


dx dY dx dY 


dx dy 3v 3 a* 


[= D x {Xo f)D 2 (Yof)- D 2 (Xo f)D x {Yof)l 


[The notation 語， etc., in the original has been preserved. Similarly, a few 
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integral curvature of a finite figure on the curved surface. However, if we wish 
to discuss the general case, some explanations will be necessary, which we can 
only touch here briefly. So long as the figure on the curved surface is such that to 
distinct points on itself there correspond distinct points on the sphere, the defi¬ 
nition needs no further explanation. But whenever this condition is not satisfied, 
it will be necessary to take into account twice or several times certain parts of 
the figure on the sphere. Whence for a similar, or inverse position, may arise an 
accumulation of areas, or the areas may partially or wholly destroy each other. 
In such a case, the simplest way is to suppose the curved surface divided into 
parts, such that each part, considered separately, satisfies the above condition; to 
assign to each of the parts its integral curvature, determining this magnitude by 
the area of the corresponding figure on the sphere, and the sign by the position 
of this figure; and, finally, to assign to the total figure the integral curvature 
arising from the addition of the integral curvatures which correspond to the 
single parts. So, generally, the integral curvature of a figure is equal to fk da, 
da denoting the element of area of the figure, and fc the measure of curvature at 
any point. The principal points concerning the geometric representation of this 
integral reduce to the following. To the perimeter of the figure on the curved 
surface (under the restriction of Art. 3) will correspond always a closed line on 
the sphere. If the latter nowhere intersect itself, it will divide the whole surface 
of the sphere into two parts, one of which will correspond to the figure on the 
curved surface; and its area (taken as positive or negative according as, with 
respect to its perimeter, its position is similar, or inverse, to the position of the 
figure on the curved surface) will represent the integral curvature of the figure 
on the curved surface. But whenever this line intersects itself once or several 
times, it will give a complicated figure, to which, however, it is possible to assign a 
definite area as legitimately as in the case of a figure without nodes; and this area, 
properly interpreted, will give always an exact value for the integral curvature. 
However, we must reserve for another occasion the more extended exposition 
of the theory of these figures viewed from this very general standpoint. 

7. 

We shall now find a formula which will express the measure of curvature 
for any point of a curved surface. Let da denote the area of an element of this 
surface; then Z da will be the area of the projection of this element on the plane 
of the coordinates x 7 y\ and consequently, if d2 is the area of the corresponding 
element on the sphere, ZdS will be the area of its projection on the same plane. 
The positive or negative sign of Z will，in fact, indicate that the position of the 
projection is similar or inverse to that of the projected element. Evidently these 
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lines later the expressions and stand for |^| and etc.] 

Gauss obtains this answer by considering an infinitesimal triangle with 
one vertex at (x, y, f(x, y)), one vertex at (x + dx, ydy, f(xdx, y -dy)), 
and one at (x + 8x, y + 8y, f(x + 8x, y + 5 少 )）• It is a challenge both to follow 
Gauss’ reasoning, and to put it in modern terms. Either way, one needs Gauss’ 
preliminary observation that 

area v(da) area of projection on (x,^)-plane of v(do) 
area do area of projection on (x, 少 ) -plane of da 

This mysterious equation really says that the tangent plane of A/ at >1 is parallel 
to the tangent plane of S 2 at v(A). If this hint does not help, simply accept the 
formula for k, which will be derived later, using modern terminology. 

The remainder of section 7 evaluates k in terms of partial derivatives of f 
(which Gauss denotes by z). 
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projections have the same ratio as to quantity and the same relation as to position 
as the elements themselves. Let us consider now a triangular element on the 
curved surface, and let us suppose that the coordinates of the three points which 
form its projection are 

%， y 

x + dx ， y + dy 
x + y + hy 

The double area of this triangle will be expressed by the formula 

dx•by — dy•bx 

and this will be in a positive or negative form according as the position of the 
side from the first point to the third, with respect to the side from the first point 
to the second, is similar or opposite to the position of the 汉 -axis of coordinates 
with respect to the x-axis of coordinates. 

In like manner, if the coordinates of the three points which form the projec¬ 
tion of the corresponding element on the sphere, from the center of the sphere 
as origin, are 

X ， Y 

X + dZ, F + dF 
Z + hX, Y-hbY 

the double area of this projection will be expressed by 

dX^bY-dY^X 

and the sign of this expression is determined in the same manner as above. 
Wherefore the measure of curvature at this point of the curved surface will be 

, dX^hY-dY^X 
axhy — ay -hx 

If now we suppose the nature of the curved surface to be defined according to 
the third method considered in Art. 4, X and Y will be in the form of functions 
of the quantities x, y. We shall have, therefore, 

dZ = (^)d：r + (^)d2/ 

cLr dy 

dY = (^)d；r + (^)d?/ 

SF — + {~)^y 

dx dy 
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When these values have been substituted, the above expression becomes 


k 

dX )( cir ( dX )( dF 

- 八 K dy A dx } 

Setting, 

as above, 

dz , ck 

~T~ — ~7~ ~ ^ 

dx dy 

and also 

dd2 ： _ 

dx 2 

rfH d d 21 y T d d 21 xt 

丄， 1 A — u ， — V 

clx-dy dy 2 

or 

dt = Tdx U dy, du = Udx i-Vdy 

we have from the formulae given above 


x = -tz ， 

Y = —uZy (1 + 卢 + u 2 )Z 2 = 1 

and hence 

dX = - 

Z d/^ — t diZ 


dY =- 

Z du — udZ 


(1 + 卢 - 

f- u 2 )dZ + Z(tdt - {-udu) — 0 

or 

dZ = 

: —Z^(tdt - {-ud.u) 


dX = 

—Z^{1 + u 2 )dt + Z^tudu 


dY = 

: - {~Z^t2idit — 么 3(1 + du 

and so 

dX _ 

= Z s (-(l+u 2 )T-htuU) 


dX _ 

= Z s (-(l+u 2 )U-htuV) 


dY 

d7 = 

= Z 3 (tuT-(l + t 2 )U) 


dY = Z s (tnU - a-^t z )V) 

Substituting these values in the above expression, it becomes 

k: 

， Z\TV-U^l^^uh-ZHTV-U ^)- 二二 
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§8. This section, except for the last theorem, was already done in Chapter 2. 
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8 . 

By a suitable choice of origin and axes of coordinates, we can easily make 
the values of the quantities t, u, U vanish for a definite point A. Indeed, the 
first two conditions will be fulfilled at once if the tangent plane at this point be 
taken for the x 汉 -plane. If ， further, the origin is placed at the point A itself, the 
expression for the coordinate z evidently takes the form 

z 二 + u°xy + + ^ 


where fl will be of higher degree than the second. Turning now the axes of x 
and y through an angle M such that 


tan 2 Af 二 


2L 7 ° 

yO_yO 


it is easily seen that there must result an equation of the form 

z= \Tx 2 ^r\Vy 2 


In this way the third condition is also satisfied. When this has been done, it is 
evident that 

I. If the curved surface be cut by a plane passing through the normal 
itself and through the x-axis，a plane curve will be obtained, the radius of 
curvature of which at the point A will be 二 j，the positive or negative sign 
indicating that the curve is concave or convex toward that region toward 
which the coordinates 之 are positive. 

II. In like manner — will be the radius of curvature at the point A of the 
plane curve which is the intersection of the surface and the plane through the 
汉 -axis and the 之 -axis. 

III. Setting x cos 9 , 汉 =r sirup, the equation becomes 

c = \{T cos 2 9 + V sin 2 cp)r 2 + fl 

from which we see that if the section is made by a plane through the normal at A 
and making an angle 9 with the x-axis，we shall have a plane curve whose radius 
of curvature at the point A will 

__ 1 _ 

T cos 2 屮 + y sin 2 ip 

IV. Therefore, whenever we have T = V, the radii of curvature in all the 
normal planes will be equal. But if T and V are not equal, it is evident that, since 
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§§9 ， 10, 11. These sections are essentially calculations, involving no new ideas. 
Every once in a while Gauss calculates a differential instead of some partial 
derivatives, but this should cause no difficulties. 

The goal is the very last ， four-line-long, equation at the end of section 11. 
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for any value whatever of the angle 9 , T cos 2 9 + V sin 2 9 falls between T and 
V, the radii of curvature in the principal sections considered in I and II refer to 
the extreme curvatures; that is to say, the one to the maximum curvature, the 
other to the minimum, if T and V have the same sign. On the other hand, one 
has the greatest convex curvature, the other the greatest concave curvature, 
if T and V have opposite signs. These conclusions contain almost all that the 
illustrious Euler was the first to prove on the curvature of curved surfaces. 

V. The measure of curvature at the point A on the curved surface takes 
the very simple form k = TV, whence we have the 

Theorem. The measure of curvature at any point whatever of the surface 
is equal to a fraction whose numerator is unity，and whose denominator is the 
product of the two extreme radii of curvature of the sections by normal planes. 

At the same time it is clear that the measure of curvature is positive for 
concavo-concave or convexo-convex surfaces (which distinction is not essential), 
but negative for concavo-convex surfaces. If the surface consists of parts of each 
kind, then on the lines separating the two kinds the measure of curvature ought 
to vanish. Later we shall make a detailed study of the nature of curved surfaces 
for which the measure of curvature everywhere vanishes. 

9. 

The general formula for the measure of curvature given at the end of Art. 7 
is the most simple of all, since it involves only five elements. We shall arrive at 
a more complicated formula, indeed, one involving nine elements, if we wish to 
use the first method of representing a curved surface. Keeping the notation of 
Art. 4, let us set also 

ddvr 
dx 2 
deny 
dy-dz 

so that 

dP = P r dx R ,f + Q" d^; 
dQ = R" dx + Q’ dy + P n d 2 ： 
dR = Q n dx + P n dy + R' dc; 

Now since t = —we find through differentiation 

R 

R 2 dt = -RdP + PdR = (PQ n -RP , )dx^ r (PP f, -RR ,, )dy^ r (PR , -RQ ,f )dz 



deny _ 
dy 2 

= Q ’， 

deny _ 

dx 2 

二 R’ 

p\ 

ddvr _ 
dxcU 

: Q "， 

ddW _ 
dx-di / 

二 R’ 
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As you can probably figure out for yourself，6 is an alternate form of p.] 
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or, eliminating dzby means of the equation Pdx Qdy Rdz = 0, 

R s dt = (- + 2PRQ” - P 2 R r )dx^ (PRP n + QRQ” — PQR f - R 2 R f, )dy 

In like manner we obtain 

R d du = (PRP ,r + QRQ” — PQR’ — R 2 R ,f )dx^(- /2 2 Q，+ 2QRP” - Q 2 R f )dy 
From this we conclude that 

R S T = -R 2 P ， -^2PRQ n - P 2 R f 
R 3 U = PRP” + QRQ n - PQR r - R z R n 
R S V = -R 2 Q r + 2QRP n - Q 2 R f 

Substituting these values in the formula of Art. 7, we obtain for the measure of 
curvature k the following symmetric expression: 

(P 2 + Q 2 + R 2 fk 

=P\Q ， R，- P f,2 ) + Q 2 (P , R , - Q" 2 ) + /2 2 (P , Q , - R ,f2 ) 

+ 2Qi?(Q 〃 i ?〃 一 P ， P”）+ 2PR(P”R” 一 Q ， Q” ） +2PQ(P ， r Q” 一 R ! R n ) 


10 . 

We obtain a still more complicated formula, indeed, one involving fifteen 
elements, if we follow the second general method of defining the nature of a 
curved surface. It is, however, very important that we develop this formula 
also. Retaining the notations of Art. 4, let us put also 


ddx 


AAx 

t 


= ot, 


- ot , 

dp 2 

Ap-Aq 

ddy _ 

= 6， 

_ _ 

= 6'， 

dp 2 

Apdq 

dcU _ 

: 1， 

dcU _ 

:1’， 

dp 2 

dp dg 


A Ax 
dq 2 


— a 


ft 




del: 

dq 2 



ft 


and let us put, for brevity, 


b& — cb r = A 
ca r — ac r = B 
ab r — ba f — C 
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A D 

First we see that Adx-]-Bdy~hC dz = O,ordz = —— dx —- dy; thus, inasmuch 
as 2 ; may be regarded as a function of x, y y we have 


Az 

dz 



A 

C 

B 

C 


Then from the formulae dx = adp + a r dq, dy = bdp + b’ dg 7 we have 


Cdp — b r dx — a r dy 
Cdq = —b dx -ha dy 


Thence we obtain for the total differentials of t, u 


C d dt = (A— 

dp 

C s du = (B~ 

dp 


dx — a f dy) + (C— — A—)(b dx — ady) 

dp 

C^-W dx - a f dy) + (C^ - B^)(b dx-ady) 

dp 


If now we substitute in these formulae 


dA 

dp 

dA 

扣 

dB 

dp 

AB 

如 

dC 

dp 

dC 

dg 


= c r 6 + by r — c6 r — b r y 
= c r 6 f + by ,r - c6 ,f - b , y , 
=a'y -f Col' — ay r — C r a 
=a r y f + cot" - ay” 一 c f a 
— b r a -f a6 f — ba f — a f 6 
= b’ct’ + a6 〃 一 bo" — a/6’ 


and if we note that the values of the differentials dt, du thus obtained must 
be equal, independently of the differentials d^, dy, to the quantities T d 尤 + 
U dy，U dx -h V dy respectively, we shall find, after some sufficiently obvious 
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transformations, 

C 3 T= aAb ,2 + 6Bb ,2 + yCb ,2 

- 2a f Abb r - 26, 編 ，— 2y , Cbb t 

+ a ,, A6 2 + 6 ,, B6 2 + 7 ， , C6 2 

C d U= -aAa f b f - 6Ba’b ，一、 Ca ， b’ 

+ a A{ab' + ba') + 6 , B(ab , + ba r ) + y'dab' + ba’) 
— a” Aab — 6” Bab — y r, Cab 
C^V = olAo/ 2 + 6Ba/2 + yCd r ^ 

— 2a r Aaa f — 26'Baa/ — 2y r Caa r 
+ a" Act^ 1 4 - 6 〃 5(x2 + y" {JcL^ 

Hence, if we put, for the sake of brevity, 


Act + B6 + Cy = D .(1) 

Aa ? + B6 r + Cy r = D f . ⑵ 

Aa" + B6" + CV 二 D” .(3) 


we shall have 

C 3 T 二 Db ,2 - 2Zm ， +ZT6 2 
(^U = -Da ， b’~\~ £T(ab’ + ba/) — D”ab 
C^V = Da ,2 — 2D r aa f + D”a 2 


From this we find, after the reckoning has been carried out, 

C 6 (TV - U 2 ) = (DD” - D ,2 )(ab f - ba f ) 2 = (DD” - D f2 )C 2 

and therefore the formula for the measure of curvature 

k _ DP"-D ' 2 
~ (A 2 + 5 2 + C 2 ) 2 

11 . 


By means of the formula just found we are going to establish another, which 
may be counted among the most productive theorems in the theory of curved 
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surfaces. Let us introduce the following notation: 

a 2 +6 2 +c 2 =E 

aa’ +66’ + cc' = F 

a ,2 + b ,2 + c ,2 = G 

aa +66 + cy = m .(4) 

aa f + b6 r + cy r = m f .(5) 

act" +66" + cy f, = m" . ⑹ 

a ’ol - \-b r 6 - \-c r y —n .(7) 

a/a ， +6 , 6 , +c ， 7 , ⑻ 

a ， ot" + 6,6 〃 +cY'= 柃 " . ⑼ 


A 2 +B 2 +C 2 = EG - F 2 = A 

Let us eliminate from the equations 1,4,7 the quantities 6 , 7 , which is done 
by multiplying them by be’ 一 cb\ b f C — c ， B，cB — bC respectively and adding: 
in this way we obtain 

(A(bc r — cb r ) + a(b r C — c r B) + a’ (cB — bC))a 
=D(b& — cb') + m(b’C — c f B) + n(cB — bC) 

an equation which is easily transformed into 

AD — aA + a(nF — mG) + a'(mF — nE) 

Likewise the elimination of a, 7 or a, 6 from the same equations gives 

BD = 6 A + b(nF — mG) + b’(mF — nE) 

CD = 7 A + c(nF — mG) + c’(mF — nE) 

Multiplying these three equations by ct〃， 6 〃， 7 〃 respectively and adding, we 
obtain 

DD ,f = (cm" + 66 " + 77 〃）A + m’’(nF — mG) + n”(mF — nE) . . . (10) 

If we treat the equations 2 ,5 , 8 in the same way, we obtain 

AD’ = ot’A + ain'F - m f G) + a’(m’F - n'E) 

BD' — 6 'A + b(n f F — m'G) + b’(m’F - n'E) 

CD 1 — y r A + c(n'F — m’G) + c’(m r F — n'E) 
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§12. If M, N are surfaces，then a development of M on is simply a map 
f' M — N which is an isometry (with respect to the induced Riemannian 
metrics). 
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and after these equations are multiplied by ot’ ， 6’ ， 7’ respectively, addition gives 

D ， 2 = (a ,2 +6 ,2 + 7 , 2 )厶 + m f 、 n，F — m’G) + n ， (m f F — n f E) 

A combination of this equation with equation ( 10 ) gives 

DD r, — = (cm" + 66" + 77^ — a r ^ — 6^ — 

+ E(n ,2 — nn n ) + F(nm” 一 2m’n’ + mn") + G(m’ 2 — mm ”） 


It is clear that we have 


— = 2m, = 2m\ — — m r + n, — = m f, + n\ — = 2n\ — = 2n ,r 

dp dq dp aq dp dq 

or 


m 


dE 

dp’ 


dF _ 1 dE 
dp 泛 dq’ 


m’ 

n’ 


1 dE 

_ ldG 

2 dp’ 


m” 

n” 


dF 

ldG 

2 



Moreover, it is easily shown that we shall have 

",, 〃 ,2 pil ,2 dn dn f dm” dm’ 

aa +00 +77 — a — 6 — y = - = - 

dg dp dp dq 

_ _i ddg ddF 1 ddG 

^ dq 2 dpdq ^ dp 2 


If we substitute these different expressions in the formula for the measure of 
curvature derived at the end of the preceding article, we obtain the following for¬ 
mula, which involves only the quantities E, F, G and their differential quotients 
of the first and second orders: 


A(EG- F 2 ) 2 k 


rpAE dG odF dG , AG, 2 , 

必 \ 7 ~ * 二- ^ j — r V—J ) 

aq aq ap aq ap 

, jpAE dG dE dG 0 dE dF . .dF dF Q dF dG, 
y dp dq dq dp dq dq dp dq dp dp J 

rdE dG_ dF + (—f) 

dp dp dp dq dq 


2 (EG- F 2 )(— - 2 


ddF . ddG 、 


dg 2 dp.dg dp 2 


12 . 


Since we always have 

dx 2 + dy 2 + dz 2 = E dp 2 + 2 F dp • + Gdq 2 
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it is clear that -J{E dp 2, + 2P dp • dg + G d^ 2 ) is the general expression for the 
linear element on the curved surface. The analysis developed in the preceding 
article thus shows us that for finding the measure of curvature there is no 
need of finite formulae, which express the coordinates x, y, z as functions of 
the indeterminants p, q\ but that the general expression for the magnitude of 
any linear element is sufficient. Let us proceed to some applications of this very 
important theorem. 

Suppose that our surface can be developed upon another surface, curved or 
plane, so that to each point of the former surface, determined by the coordinates 
x ， y ， z ， will correspond a definite point of the latter surface, whose coordinates 
are y r ， z r . Evidently y’ ， z’ can also be regarded as functions of the 
indeterminants p, q, and therefore for the element ^(dx ,2 + dy ,2 + dz f2 ) we 
shall have an expression of the form 

dp 2 + 2F f dp • + G f d^ 2 ) 

where E\ F\ G r also denote functions of p,q. But from the very notion of the 
development of one surface upon another it is clear that the elements corre¬ 
sponding to one another on the two surfaces are necessarily equal. Therefore 
we shall have identically 

E 二 E，，F 二 F，，G 二 G ， 


Thus the formula of the preceding article leads of itself to the remarkable 

Theorem. If a curved surface is developed upon any other surface whatever ， 
the measure of curvature in each point remains unchanged. 

Also it is evident that any finite part whatever of the curved surface will 
retain the same integral curvature after development upon another surface. 

Surfaces developable upon a plane constitute the particular case to which 
geometers have heretofore restricted their attention. Our theory shows at once 
that the measure of curvature at every point of such surfaces is equal to zero. 
Consequently, if the nature of these surfaces is defined according to the third 
method, we shall have at every point 


dd 之 

d«u 2 


dd2 ^ dck >2 
dy 2 dx-dy 



a criterion which, though indeed known a short time ago, has not, at least to our 
knowledge, commonly been demonstrated with as much rigor as is desirable. 


13. 

What we have explained in the preceding article is connected with a particu¬ 
lar method of studying surfaces, a very worthy method which maybe thoroughly 
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§14. Throughout this section Gauss uses x, y, z to denote x o c, y o c, z o c, for 
the curve c under consideration. The integral in the second display，involving 
both d and is what we would write as 


dL(a(u)) 

du 


d 





da x {u,t) 

~~ Ft ~~ 


2 




3a 1 (0, t) d 2 a x {^, t) 
du dudt 



dc x d 2 a l (0, t) 
dt dudt ~*~ 


dt. 


Thus, 


dc l • dx , 3 2 0^(0,0 3 2 0^(0,0 . d5x 


is 


and 


is 


A — ⑷]…… dudt dtdu — [dtdu]' 

show what this becomes after integration by parts. The integral is 


The next two lines 


,b da^ 

du 




dc x jdt 


+ 


攀•辱 


dt; 


here 


3a 1 

du 


(0,0 is 


8x 

Wr 
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developed by geometers. When a surface is regarded, not as the boundary of 
a solid, but as a flexible, though not extensible solid, one dimension of which is 
supposed to vanish, then the properties of the surface depend in part upon the 
form to which we can suppose it reduced, and in part are absolute and remain 
invariable, whatever may be the form into which the surface is bent. To these 
latter properties, the study of which opens to geometry a new and fertile field, 
belong the measure of curvature and the integral curvature, in the sense which 
we have given to these expressions. To these belong also the theory of shortest 
lines, and a great part of what we reserve to be treated later. From this point 
of view, a plane surface and a surface developable on a plane, e.g.，cylindrical 
surfaces, conical surfaces, etc., are to be regarded as essentially identical; and the 
generic method of defining in a general manner the nature of the surfaces thus 
considered is always based upon the formula y/(Edp 2 + 2Fdp • dg + Gdq 2 ), 
which connects the linear element with the two indeterminants p, q. But before 
following this study further, we must introduce the principles of the theory of 
shortest lines on a given curved surface. 


14. 


The nature of a curved line in space is generally given in such a way that 
the coordinates x, y, z corresponding to the different points of it are given in the 
form of functions of a single variable, which we shall call w. The length of such 
a line from an arbitrary initial point to the point whose coordinates are x, y, z y is 
expressed by the integral 


/d (3)、 (卽 


2 


dy 


2 


dz 


2 


du 


dw 


dw 


If we suppose that the position of the line undergoes an infinitely small variation, 
so that the coordinates of the different points receive the variations bx, hy, bz, 
the variation of the whole length becomes 

n dxdbx-\-dydhy + dzdhz 
>/(d.r 2 +dy 2 +d^ 2 ) 

which expression we can change into the form 

da ， *S.r+ dy. 8 夕 +dz .82 
>/(d .r 2 + d " 2 + d z 2 ) 

d- - - d - ‘) ~- + 5^-d ———— ^ + 5 之 • d 7 d " 2 --) 

>/(d.r 2 +d?/ 2 +dz 2 ) >/(da， 2 + d〆+dz 2 ) >/(d.t* 2 + di/ 2 + d 2 ： 2 ) 

We know that, in case the line is to be the shortest between its end points, all that 
stands under the integral vsign must vanish. Since the line must lie on the given 
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Notice that Gauss has given dL(a(u))/du\ u=Q for an arbitrary variation in 
3 -space, not just a variation through curves in the surface. His x = x o c is a. 
coordinate function of c in 3-space, not a coordinate function with respect to 
some coordinate system on the surface. If the surface is {p : W(p) = 0 } for 
some VF: M 3 ^ M, so that on the surface we have 


0 = dW = P dx + Q dy ^ Rdz 


P = D\W 7 Q = D 2 W, R = £) 3 W 7 


then for variations a through curves on the surface we will have 


9ofi 9of2 

d W(8x, 8yJz) = dW[^-(0j) 7 —(0 7 t),—(0j) 


du 


du 


du 


0, 


and any set of da 1 /du(0 7 t) with this property comes from some variation on 
the surface. Using this，Gauss deduces a necessary and sufficient condition for 
a curve y, parameterized by arclength，to be a geodesic on the surface. Unlike 
our equations for geodesics，this condition [the next-to-last displayed formula 
in this section] is expressed in terms of quantities which make sense only in M 3 : 

y U/ (t) y 2// (0 y 3// (0 

X(y(t)) = Y(y(t)) = Z_) ’ 

i.e .， y n {t) is a multiple of the normal vector at y{t). It takes a little detective 
work to see that Gauss is really considering a curved parameterized by arclength. 
Try to prove Gauss’ result by modifying our proof of Euler’s equations. 


§15. The proof in this section is essentially our (first) proof of Gauss’ Lemma 
(1.9-12). There are two main differences. First, Gauss uses the condition of 
section 14 rather than our equations. Second，for a surface it is unnecessary to 
choose a curve v: R ^ M q and manufacture the variation a that occurs in the 
proof of Lemma 1.9-12. Instead, we just use 


a(r,<p) = point with “polar coordinates” (r, 0 ). 
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surface, whose nature is defined by the equation P + Qdy + Rdz = 0 , the 
variations by, bz also must satisfy the equation Pbx + Qby-\-Rhz = 0 , and 
from this it follows at once, according to well-known rules, that the differentials 

d— ^——， d ― d ——— 

y/(d^ -\~dy 2 + dz 2 ) y/(dx 2 + dy 2 -\-dz z ) >/(dx 2 +d 汉 2 + ck 2 ) 

must be proportional to the quantities P, Q, R respectively. Let dr be the 
element of the curved line; X the point on the sphere representing the direction 
of this element; L the point on the sphere representing the direction of the 
normal to the curved surface; finally, let iq, i be the coordinates of the point X, 
and X, Y, Z be those of the point L with reference to the center of the sphere. 
We shall then have 


dx = gdr，dy = i]dr，dz = Idr 

from which we see that the above differentials become d^, diq, d(. And since the 
quantities P, Q, R are proportional to X, Y, Z, the character of shortest lines is 
expressed by the equations 

cU _ h 

H = Y = ~z 

Moreover, it is easily seen that V(d^ 2 + diq 2 + d( 2 ) is equal to the small arc on 
the sphere which measures the angle between the directions of the tangents at 

the beginning and at the end of the element dr, and is thus = if p denotes 
the radius of curvature of the shortest line at this point; thus we shall have 

pd^ = Xdr, pdiq = F dr, = Z dr 


15 . 

Suppose that an infinite number of shortest lines go out from a given point 
A on the curved surface, and suppose that we distinguish these lines from one 
another by the angle that the first element of each of them makes with the first 
element of one of them which we take for the first. Let 屮 be that angle, or, more 
generally, a function of that angle, and r the length of such a shortest line from 
the point A to the point whose coordinates are x, y, z. Since to definite values of 
the variables r, 9 there correspond definite points of the surface, the coordinates 
x ， y，z can be regarded as function of r, 9. We shall retain for the notation 
L, ^ iq, Xj Y, Z the same meaning as in the preceding article, this notation 
referring to any point whatever on any one of the shortest lines. 
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All the shortest lines that are of the same length r will end on another line 
whose length, measured from an arbitrary initial point, we shall denote by v. 
Thus v can be regarded as a function of the indeterminants r, 9, and if y denotes 
the point on the sphere corresponding to the direction of the element dv, and 
also g’ ， t ]’， 〔 denote the coordinates of this point with reference to the center of 
the sphere, we shall have 


dx dv 

-- r" • - 


, dv 


dc _ dv 

dcp dtp 


From these equations and from the equations 


dx t 


(lZ y 


we have 


dx da? , dz dz 

dr d(p dr d^p dr d^p 




(19 (19 


Let S denote the first member of this equation, which will also be a function of 
r, 9. Differentiation of S with respect to r gives 


dS _ ddx dx ,ddy dy ddz dz . 
dr dcp dP d(p dP dtp 


d((^) 2 +(^U(^) 2 ) 

dr dr dr 

dip 


d? da? , d” 

dr d^p dr 


d?/ dz 丄 1 

^ + d ； *d^ + 2 


d(g 2 + ” 2 + ( 2 ) 

d^p 


But g 2 + ” 2 + 0 = 1， and therefore its differential = 0; and by the preceding 
article we have, if p denotes the radius of curvature of the line r, 

^_X dri _ F _ Z 

dr P J dr P J dr P 


Thus we have 


dS 

dr 


P 


( 双 + 1V + ZO 


dv 

dip 


P 


• cos LX/ 


dv 

d(p 


0 


since y evidently lies on the great circle whose pole is L. From this we see that 
S is independent of r, and is, therefore, a function of 9 alone. But for r = 0 we 


evidently have v = 0, consequently — 0, and S = 0 independently of 9. Thus, 

dip 

in general, we have necessarily S — 0 , and so cos XV = 0, i.e., — 90 °. From 

this follows the 
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Gauss also gives a “geometric” proof of the lemma, using infinitesimal trian¬ 
gles. Perhaps the easiest way to make this rigorous would be to use our second 
proof of Gauss’ Lemma. 


§16. This section states a generalization of Gauss’ Lemma, which has also been 
given in Problem 1.9-28. 


§17. In terms of a coordinate system (/?, q) on a surface, the Riemannian metric 
that it acquires as a subset of R 3 has the expression 

〈 ，)=E dp ^ dp F dp ^ dq F dq ^ dp G dq ^ dq. 


so that 


|| || = "E dp ， dp 2F dp . dq G dq ， dq. 
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Theorem. If on a curved surface an infinite number of shortest lines of equal 
length be drawn from the same initial point, the lines joining their extremities 
will be normal to each of the lines. 

We have thought it worth while to deduce this theorem from the fundamen¬ 
tal property of shortest lines; but the truth of the theorem can be made apparent 
without any calculation by means of the following reasoning. Let AB, AB r be 
two shortest lines of the same length including at A an infinitely small angle, 
and let us suppose that one of the angles made by the element BB f with the 
lines BA, B r A differs from a right angle by a finite quantity. Then, by the law 
of continuity, one will be greater and the other less than a right angle. Suppose 
the angle at B is equal to 90 ° — o>, and take on the line AB a point C, such that 
BC = BB r • cosec o>. Then, since the infinitely small triangle BB f C may be 
regarded as plane, we shall have CB f = BC • cos o>, and consequently 

AC + CB' = AC -h BC - cos o> = AB — BC - (1 — cos a>) = AB r — BC - (1 — cos a>), 

i.e., the path from Ato B f through the point C is shorter than the shortest line, 

Q.E.D. 


16 . 

With the theorem of the preceding article we associate another, which we 
state as follows: If on a curved surface we imagine any line whatever, from the 
different points of which are drawn at right angles and toward the same side an 
infinite number of shortest lines of the same length, the curve which joins their 
other extremities will cut each of the lines at right angles. For the demonstration 
of this theorem no change need be made in the preceding analysis, except that 
must denote the length of the given curve measured from an arbitrary point; 
or rather, a function of this length. Thus all of the reasoning will hold here also, 
with this modification, that S = 0 for r = 0 is now implied in the hypothesis 
itself. Moreover, this theorem is more general than the preceding one, for we 
can regard it as including the first one if we take for the given line the infinitely 
small circle described about the center A. Finally, we may say that here also 
geometric considerations may take the place of the analysis, which, however, 
we shall not take the time to consider here, since they are sufficiently obvious. 

17 . 

We return to the formula ^/{E dp 2 -\- 2 F dp ■ dq G dq 2 )， which expresses 
generally the magnitude of a linear element on the curved surface, and inves¬ 
tigate, first of all，the geometric meaning of the coefficients 芯， F ， G. We have 
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Gauss uses a) to denote the angle between d/dp and d/dq (thus, w is a function 
on the surface). Gauss’ formula for cos w should be clear. Gauss also mentions 
that 

dV = y/EG - F 2 dp A dq, 

a special case of the formula on pg. 1.311. 

To interpret the last two formulas in this section, we must divide ds, dp, and 
dq by dt in all places; it is to be understood that dp/dt = (poc)’(t), etc., where c 
is the curve we are considering. It is simplest to assume that c is parameterized 
by arclength, so that the terms ds/dt are 1. If 


0 ( 5 ) = angle between c’C?) and 


then 




c(s) 




cos 9 




dp(c(s)) dq{c{s)) 
tL -:- h r - 


ds 


ds 


^f~E 


since 


c 


dp{c{s)) 


ds dp + 


dq{c{s)) 


ds dq 

Moreover, the area of the parallelogram spanned by d and 3/3/? is 

d 


sm 




and also 


dV 




， 〆 ) ， 


from which we obtain 


yjEG - F 2 


sin 9 


dq{c{s)) 

ds 


^/E 


§18. In this section Gauss deduces the conditions for a curve y (having the 
component functions y l = p o y, y 2 = g o y) to be 2 l critical point for the length 
function. 

Unlike the condition in section 14, the result is expressed totally in terms of 
the Riemannian metric ( , } on the surface, and is essentially the condition for 
a geodesic that we obtained in Chapter 1.9. However, the derivation is different ， 
because the geodesic is assumed to satisfy q{y{t)) — y 2 {t) = t [“we regard p 
as a function of q"]. 

It is not necessary to actually follow the derivation. The really important 
point is simply the equation that constitutes the first line that appears in the 
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already said in Art. 5 that two systems of lines may be supposed to lie on the 

curved surface, p being variable, q constant along each of the lines of the one 

system; and q variable, p constant along each of the lines of the other system. 

Any point whatever on the surface can be regarded as the intersection of a line 

of the first system with a line of the second; and then the element of the first line 

adjacent to this point and corresponding to a variation dp will be 二 y/E - dp, 

and the element of the second line corresponding to the variation dg will be 

= y/G-dq. Finally, denoting by o> the angle between these elements, it is easily 

seen that we shall have cosco = ―, — • Furthermore, the area of the surface 

y/EG 

element in the form of a parallelogram between the two lines of the first system, 
to which correspond q,q + dq, and the two lines of the second system, to which 
correspond p,p dp, will be y/(EG — F 2 ) dp - dq. 

Any line whatever on the curved surface belonging to neither of the two 
systems is determined when p and q are supposed to be functions of a new 
variable, or one of them is supposed to be a function of the other. Let s be 
the length of such a curve, measured from an arbitrary initial point, and in 
either direction chosen as positive. Let 0 denote the angle which the element 
ds = ^/(E dp 2 + 2Fdp • dg + Gdq 2 ) makes with the line of the first system 
drawn through the initial point of the element, and, in order that no ambiguity 
may arise, let us suppose that this angle is measured from that branch of the 
first line on which the values of p increase, and is taken as positive toward that 
side toward which the values of q increase. These conventions being made, it is 
easily seen that 


cos 0 • ds = y/E - dp + -JG - cos co • dg = 


sin 0 • ds = a/G • sin a) • dg 


^(EG~F 2 )-dq 

7e 


18 . 

We shall now investigate the condition that this line be a shortest line. Since 
its length s is expressed by the integral 

s = f ^/iEdp 2, + 2Fdp • dg + Gdq 2 ) 

the condition for a minimum requires that the variation of this integral arising 
from an infinitely small change in the position become 二 0 . The calculation, for 
our purpose, is more simply made in this case, if we regard p as a function of q. 
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second large display on page 105 (after the words “Thus we have”). For a curve 
parameterized by arclength, this equation says that 


dE 

3 /? 


(c(s)) 


( 


dc l 




2 


3F dc x dc 1 dG 
+ 2^-(c(^)) ——+ — 

as as op 

d 

— ? —— 
ds 


ds ) dp 


(c(s)) 


/ dc 2 
V ds 


) 


E(c(s)) 


dc 1 

ds 


F(c(s)) 


dc 2 

ds 


It is a very useful exercise to write out the equations on pg. 1.329 for the case 
of a 2-dimensional manifold, with — E, g \2 — F, g 2 i = G, and show that 
the first of these equations (the equation for A: = 1) yields the above equation (it 
will be necessary to perform the differentiation on the right side). 

Although Gauss performs various further manipulations, it is only necessary 
to follow the next step, 

d f ^dc l dc 2 ~\ d ( —八 

2— E — — |- F —r— = 2—\E cos0, 
ds ds ds ds 

■ 雨 

where 6 is defined in the previous section. 


§19. In this section Gauss rewrites formulas from preceding sections for the case 
of a coordinate system (p,q) which is “orthogonal” ((d/dp, d/dq) = F = 0). 
The important case for us is the last he considers, in which the coordinates are 
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When this is done, if the variation is denoted by the characteristic 5, we have 


5s 


a r oh p dr 

:—— - dp 2 +- - dp - d <7 + —■ dq 2 ) hp + (2EdP + 2F do) d hp 

dp dp dp 

2 ds 


Uo 


LI 厂 


dp 


dp 


2 ds 


t dp + Faq 


and we know that what is included under the integral sign must vanish indepen¬ 
dently of bp. Thus we have 


罢•如 2 


2 dF 

dp 


• dp • dg + ^ • dq 2 = 2ds • d • E ^ v + F d ? 
dp ds 

= 2ds • d - y/E - cos0 = ds，d ^ cose — 2ds - d0 - \/E - sin0 


= - 2 ^/{EG - F 2 ) .dg.de 

E 

= Edp + Fd_ q) .(M.dp + ^.dg)- 2 ^(EG - F 2 ) -dg.de 

E dp ^ dq ] 


This gives the following conditional equation for a shortest line: 


V(^G-F 2 ).de = 

^ E dp 


i . i F dE i , i dE i dF i l dG i 

d?)+ 2^*^* d 9 + 2*^- d P-^* d P-2*^* d 9 


which can also be written 

y/(JEG — F 2 ) • de = I . g . 狀 + I . ¥ • dp — f • dp - I • # • dg 

△ E ^dq dp ^ dp 

From this equation, by means of the equation 

E dp f 

cote = —^~-•/ + ^ - 

V(EG-F 2 ) d(? ^(EG-F 2 ) 

it is also possible to eliminate the angle 0, and to derive a differential equation 
of the second order between p and g, which, however, would become more 
complicated and less useful for applications than the preceding. 


19. 

The general formulae, which we have derived in Arts. 11,18 for the measure 
of curvature and the variation in the direction of a shortest line, become much 
simpler if the quantities p, q are so chosen that the lines of the first system cut 
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the “polar coordinates” （ r ， 0) defined in terms of the geodesics emanating from 
a point A of the surface. Here Gauss obtains the formula 

丄— 

4g ^ 5 

and 

dd dy/G d<p(c(s)) 

■i I 甲 ,, _ 

ds dr ds ’ 


where 6 is the angle the geodesic c makes with the lines 0 = constant. Notice 
that (r, <f>) is not a coordinate system on a whole neighborhood of A; we must 
delete one geodesic ray, including the point A itself. Consequently, and 
dy/G/dr are not even defined at A. Gauss’ final assertions in this section should 
be interpreted as saying that 


lim JG(B) = 0 

B~^A 


lim 


dy/~G 


dr 


(B) 




1 . 
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everywhere orthogonally the lines of the second system; i.e.，in such a way that 
we have generally co = 90°, or F = 0. Then the formula for the measure of 
curvature becomes 


4E 2 G 2 k = E . 


dE dG 
- • - 

dq dq 




dG、 2 . ^ dE dG . r,AE 2 


)+ G • — • — + G(—) - 2EG( 

dp dp dp dq 


dd^ ddG, 
dq 2 dp 2 


and for the variation of the angle 0 


JEG • de 


2 




dp - 


2 


dG 

dp 


dq 


Among the various cases in which we have this condition of orthogonality, 
the most important is that in which all the lines of one of the two systems, e.g .， 
the first, are shortest lines. Here for a constant value of q the angle 0 becomes 
= 0, and therefore the equation for the variation of 0 just given shows that we 


must have 


¥ 




0, or that the coefficient 芯 must be independent of q; i.e., E 


must be either a constant or a function of p alone. It will be simplest to take for 
p the length of each line of the first system, which length, when all the lines of 
the first system meet in a point，is to be measured from this point ， or，if there is 
no common intersection，from any line whatever of the second system. Having 
made these conventions, it is evident that p and g denote now the same quantities 
that were expressed in Arts. 15, 16 by r and (p，and that E = l. Thus the two 
preceding formulae become: 


4G 2 k = 


C ) 2 - 2 ㉝ 


• d0 = —i • ^ • dg 

^ dp 


or, setting ^/G = m, 


1 ddm 
m dp 2. 


d0 = 


dm 

dp 


- dq 


Generally speaking, m will be a function of p, q, and m dq the expression for 
the element of any line whatever of the second system. But in the particular 
case where all the lines p go out from the same point, evidently we must have 
m = 0 for p 二 0 . Furthermore, in the case under discussion we will take for q 
the angle itself which the first element of any line whatever of the first system 
makes with the element of any one of the lines chosen arbitrarily. Then, since for 
an infinitely small value of p the element of a line of the second system (which 
can be regarded as a circle described with radius p) is = p dq ， we shall have for 
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an infinitely small value of p, m = p y and consequently, for p = 0, m = 0 at the 

d TYL 

same time, and — = 1. 


We pause to investigate the case in which we suppose that p denotes in a 
general manner the length of the shortest line drawn from a fixed point A to 
any other point whatever of the surface, and q the angle that the first element 
of this line makes with the first element of another given shortest line going 
out from A. Let B be a definite point in the latter line, for which g = 0, and C 
another definite point of the surface, at which we denote the value of q simply 
by A. Let us suppose the points B, C joined by a shortest line, the parts of which, 
measured from B, we denote in a general way, as in Art. 18, by s; and, as in the 
same article, let us denote by 0 the angle which any element ds makes with the 
element dp; finally, let us denote by 0°, 0' the values of the angle 0 at the points 
B ， C. We have thus on the curved surface a triangle formed by shortest lines. 
The angles of this triangle at B and C we shall denote simply by the same letters, 
and B will be equal to 180° — 0, C to0 f itself. But, since it is easily seen from our 
analysis that all the angles are supposed to be expressed, not in degrees, but by 
numbers, in such a way that the angle 57 0 17’45〃，to which corresponds an arc 
equal to the radius, is taken for the unit, we must set 

0 ^ = 7T — B, = C 


where 2 tt denotes the circumference of the sphere. Let us now examine the 
integral curvature of this triangle, which is = / kda, da denoting a surface 
element of the triangle. Wherefore, since this element is expressed by mdp-dg, 
we must extend the integral ffmdp dq over the whole surface of the triangle. 

Let us begin by integration with respect to p, which, because k 1 




dm 


dp" 


gives do - (Const. - ), for the integral curvature of the area lying between 

dp 

the lines of the first system, to which correspond the values g，g + dg of the 
second indeterminate. Since this integral curvature must vanish for p = 0, the 
constant introduced by integration must be equal to the value of — for p = 0, 

i.e., equal to unity. Thus we have do(l —— where for — must be taken the 

dp ap 

value corresponding to the end of this area on the line CB. But on this line we 
have, by the preceding article, 觉 .dq = -d9, whence our expression is changed 

into dg + d0. Now by a second integration, taken from g — 0 to g = A, we find 
that the integral curvature = A + 0' — d° = A + B + C — tt. 
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The integral curvature is equal to the area of that part of the sphere which 
corresponds to the triangle, taken with the positive or negative sign according 
as the curved surface on which the triangle lies is concavo-concave or concavo- 
convex. For unit area will be taken the square whose side is equal to unity (the 
radius of the sphere), and then the whole surface of the sphere becomes = 4 tt. 
Thus the part of the surface of the sphere corresponding to the triangle is to the 
whole surface of the sphere as 士 (A + B + C — tt) is to 4 tt. This theorem, which, 
if we mistake not, ought to be counted among the most elegant in the theory of 
curved surfaces, may also be stated as follows: 

The excess over 180° of the sum of the angles of a triangle formed by shortest 
lines on a concavo-concave curved surface，or the deficit from 180° of the sum 
of the angles of a triangle formed by shortest lines on a concavo-convex curved 
surface, is measured by the area of the part of the sphere which corresponds, 
through the directions of the normals, to that triangle, if the whole surface of the 
sphere is set equal to 720 degrees. 

More generally, in any polygon whatever of n sides, each formed by a 
shortest line, the excess of the sum of the angles over (2?i — 4) right angles, 
or the deficit from (2n — 4) right angles (according to the nature of the curved 
surface), is equal to the area of the corresponding polygon on the sphere, if the 
whole surface of the sphere is set equal to 720 degrees. This follows at once from 
the preceding theorem by dividing the polygon into triangles. 
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B. GAUSS，THEORY OF SURFACES 

This part of the chapter presents Gauss’ results in modern dressing. It can 
be read completely independently of the first part，but there are frequent com¬ 
parisons with Gauss’ original paper. Some of Gauss’ notation will be changed; 
in particular，we will use K for Gauss 7 k. 

We consider a 2-dimensional submanifold M of M 3 , with /: M ^ M 3 the 
inclusion jnap. We also assume that M has been oriented. Since all our results 
will be local ones，we merely need an orientation in a neighborhood of each 
point, so this assumption does not place any real restriction on M. (However ， 
we must still investigate to what extent our results depend on the choice of the 
orientation.) 

At each point p e M there is a unique unit vector v(p) g E 3 such that 

(1) v(p) p e R 3 P is perpendicular to M p 

(2) v, w is positively oriented in E 3 whenever v p ， w p e M p is positively 
oriented. 

We thus have the normal map v : M ^ IR 3 , which actually goes to the unit 
sphere, v: M ^ S 2 C IR 3 . Notice that in his paper Gauss uses X, Y, Z for 



v 1 (p) 7 v 2 (p ) 7 v^(p). The idea of using this map may have been suggested to 
Gauss by astronomical practices，as he indicates in an abstract of the paper (also 
included in the Princeton University Library translation, and the Raven Press 
reprint). At any rate，the map v turns out to play such a crucial role that it is 
often called the Gauss map. 

An explicit formula for v: M ^ S 2 can be obtained from various explicit 
descriptions of M. For example，if M = {/? g M 3 : W(p) — 0} for some 
function VF: E 3 ^ E, then dW = 0 on M, i.e. ， ) — 0 for 3.11 q. 

This means that 

D\W{q) - v 1 + DiW{q) - v 2 + D^W{q) • u 3 = 0 

SW / ^ , dW 9 , for all v q e M q . 



The Curvature of Surfaces in Space 


113 


This equation can be written 

i(dW dW dW \ \ 

jy ⑻， , v) = 0 for all v q g M q . 

Consequently, 

.,(dW dW dW 

v(《)=normalized 1—(^), ~^*( 《 ) ， ^T( 分 ) 

which is precisely the formula Gauss gives. 

We can also find v in terms of a coordinate system x* To avoid confusion, we 
will denote the standard coordinate system in M 2 by (s, t) [Gauss uses (p,g)] 




and we will denote the inverse function / 一 1 : R 2 ^ M C M 3 by /• It is 
naturally necessary to consider the component functions of /, considered as a 
map into M 3 , in order to obtain a formula for v; we cannot obtain a formula 
for v totally in terms of x, since this coordinate system tells us nothing about 
the way M is situated in M 3 . Note that if ^ = f{s,t), then 


a 


3x 


q \ / q 




3s 


3s 


3s 


q 


a 


3x 


2 


q 




CM) 




q 


dt 


dt 


dt 


q 


Consequently, 


v(f ( 5 *, /)) = normalized cross product 

(df l 3/ 2 /a/^ a /2 a/ 3 \ 

V'a7 , "a7 , 17 ) V ^ H) 
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Thus we have 


df 2 df 3 df 2 df 


v(f(sj)) — ± 


ds dt 


dt ds 


A 


for 

△ = norm of the cross product, 

exactly as in Gauss. 

Finally, if M is the graph of g ： ]R 2 — M, so that 

M = {(x,y, g(x,y)) : x,y e M 2 }, 
then M is the image of /: M 2 ^ M 3 defined by 

/CM) = CM ， gCM)). 


It follows that 


S( x ^ J 7 )) — normalized cross product 




We are now ready for a preliminary, non-rigorous, definition of the curvature 
K(p) of M at p: 

area v(A) 

K(p) = ]im - —, 

area A 

where the limit is taken as the region A around p becomes smaller and smaller. 
There would be considerable difficulties involved in making this definition rig¬ 
orous. In the first place, we would have to prove that the limit exists. More 
crucial, the “area of v{AY needs some interpretation; in the figure below we 
want “area of v{A)^ to be negative, because v is orientation reversing near p. 



However, even with this non-rigorous definition we can find the curvature of 
certain surfaces. 





The Curvature of Surfaces in Space 


115 


Consider first the surface S 2 . The map v: S 2 ^ S 2 is just the identity, so at 


each point p e S 

K(p) — lim 



area v(A) 


lim 


area A 


A^p area A A^p area A 


For the case of the sphere S 2 (r) of radius r，we have 




which certainly seems reasonable. 

Next we consider a plane P. The function v : P — S 2 is constant, so 



K(p) 


. area v(A) 

=lim - —- 

A^p area A 


lim - 

A^p area A 



the plane does not curve. 
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Finally, we consider a cylinder Z. In this case, the function v: Z — S 2 is 
not constant, but the image of v always lies along a certain arc in S 2 , so for all 



points /? € Z we have 

火 ⑼ =lim ^ =lim = 0. 

area A area A 

The cylinder, too, does not curve! It begins to look as if we have the “wrong” 
definition of curvature; only later will Gauss Explain why this is the “right” 
definition. 

The manner in which we produce a rigorous definition of curvature is really 
very simple. Since M is a submanifold of E 3 , which has the usual Riemannian 
metric 〈，〉， we can give M the induced Riemannian metric / *( ， 〉. Together 
with the orientation which we have given M，this metric determines a 2-form 
‘WK，，on M，namely 


dV(g)(v q ,Wq) = signed area of the parallelogram spanned by v and w. 

On the sphere S 2 we also have a volume element, coming from its induced 
Riemannian metric and its usual orientation. As a glance at pg. 1.264 will 
show, this is just the 2-form which we have denoted by o f . 

We now define the Gaussian curvature K(p) of M at /? to be 


K(p) 


dV(p) 


in this equation, the division of 2-forms makes sense because any 2-form on 
the 2-dimensional manifold M is a multiple of the non-zero 2-form d V. If the 
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vectors v p ， w p G M p are orthonormal，then our definition says that 


f^{p) = ^{o f ){p){vp,w p ) 

= o f {v{p)){v^v p , v^Wp) for orthonormal v p ， w p 


This definition of K(p) involves v, and hence the orientation fi which we picked 
for a neighborhood of p. Choosing the opposite orientation —fi changes v to 
—v = Aov, where A : S 2 —> S 2 is the antipodal map, and consequently changes 

v\a f ) to v*(A*(a , )) = -v\o f ). 


On the other hand, dV \% also changed to —dV, so K(p) does not depend on 
the choice of orientation. 

Notice that if v is one-one in a neighborhood of /?, then for every region A 
contained in that neighborhood we have 


area v{A) = / a f 

J v(A) 

depending on whether v is 
orientation preserving or reversing. 


=J v*(cr’) 


Consequently, 


area ± 


area A 


dV 




We thus recover our original “definition”，provided that 




dV 




although this result seems reasonable, by continuity of v*{o f ) and dV, we will 
not worry about the exact manner in which A must approach p in order for 
the limit to work out. If v is not one-one in a neighborhood of /?， then we must 
have v^{p) = 0, so the rigorous definition gives K(p) = 0, which is more or less 
what one would expect from the intuitive definition. 

Although we will eventually obtain a neater expression for K, we begin by 
deriving Gauss’ first formula for using essentially the same reasoning as 
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Gauss uses. We first observe that the tangent plane M p is parallel to the tangent 
plane S 2 V ( P ) of S 2 at v(p). The reason is very simple: the tangent plane S 2 v ( p ) 



is perpendicular to v(p) 7 and so is M p , by the very definition of v(p). In the 
previous two sentences we have used the identification of R 3 p with R 3 ，the 


identification of M p with i^M p C M 3 P ， etc. Without further warning, we shall 
continue to do so, to avoid cluttering up the page with extra symbolism. 

If Vp ， w p e Mp are linearly independent，then 


K(p)= 


area of parallelogram P spanned by v 氺 v p ， v^w p 
area of parallelogram Q spanned by v p , w p 


Since M p is parallel to S 2 v ^ p ), this implies that 


K(p)= 


area of projection of P on (x, -plane 
area of projection of Q on (x, j)-plane 


(provided the denominator is not zero). In particular，suppose M is the graph 
of g: R 2 —^ M, and consequently the image of /: M 2 ^ M 3 defined by 


We choose 


/(M) = (M ， g(M)). 
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and consider v p , w p G M p ，for p = f (s, t). Then 


area of projection of Q on (x, -plane 

=area of parallelogram spanned by (1,0) and (0, 1) 
— 1. 


On the other hand. 




a/ 

ds 




P 


( 


a 

ds 


(p 。 /)* 

d(vof) 

ds 


3s 


) 


) 


3(i^ 。 /) 3(v 2 。 /) a(v 3 ◦ /) 


v (/ >) 


3s 


ds 


3s 


1 


(/>) 


(Here all partial derivatives are to be evaluated at the point (s, ， ). ） Similarly, 

a(〆。/) d(v 2 o f) d(v^of) 

I 

v(p) 




( 


dt 


dt 


dt 


) 


Consequently, 
K(f(sj)) = 


area of projection of P on (x, ^)-plane 
area of parallelogram spanned by 

ao 1 。/) 3 (p 2 o /)、 and /aw) aw) 


ds J ds j \ dt 
d(v x of) d(v 2 of) a(i^/)a(i^ 2 。/) 


dt 


3s 


dt 


dt 


ds 


This is precisely the formula Gauss obtains, at the top of page 77. If we use the 
formula 

( a^\ ( a^\ 

v(x,y\g(x,y ))= =normalized cross product I 1,0, — I x I 0, 1, — J , 


we obtain, after a little calculation, 


(*) 


K(x,y,g(x,y)) 


3 2 g ^ 2 g — ( ^ 2 g A 
3.y 2 dy 2 ^3x3^/ 


2 


1 + 


Gf) +( 


a 少 



2 
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We can compare this with the results of Chapter 2, in which we picked our 
coordinate system so that 

户=(0,0,0) = (0,0，发(0,0)) 

/( 0 , 0 ) = /( 0 , 0 ) = 0 
ox ay 

d 2 g 

^( 0 , 0 )- 0 . 

oxoy 

In this case, we obtained the result that the minimum k\ and maximum kj 
of all curvatures cut out by normal planes through p are the minimum and 
maximum of 3 2 ^/3x 2 (0,0) and d 2 g/dy 2 (0, 0). Now for our special choice of 
coordinates in M 3 , formula (*) becomes 

d 2 g d 2 g 

K{P) = d^d^^ kl，kl ' 

(Notice that, just as K does not depend on the orientation of M, neither does 
the product k\ - kj, even though k\ and kj individually do). We thus have the 
following result: 

The Gaussian curvature K(p) at any point p e M is the product of the 
extreme curvatures of the curves through p cut out by normal planes. 






— 0 


ki =| 士 1/fl 




cylinder of radius a 

K = k\ • k2 — ^ 



sphere of radius a 

K= ， k^k 2 =^ 

a 1 



To prove this result we have followed Gauss’ exposition. In particular, the 
proof of Euler’s Theorem which appeared in Chapter 2 is Gauss’. Undoubtedly, 
this proof is considerably simpler than Euler’s, for Gauss remarks with pride 
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“These conclusions contain almost all that the illustrious Euler was the first to 
prove on the curvature of curved surfaces.” Nevertheless, later developments 
provided a nicer way of obtaining these results, which will consequently now be 
rederived. 

The definition of curvature involves the map v: M ^ 5 2 C M 3 , but even 
more important, it involves the map : M p — S 2 V { P ). We can also think of v 
as an M 3 -valued function v : M M 3 , and we then have a map 

dv *. A/ p ~ 

namely 

dv{v p ) = {dv\v p ),dv 1 {v p ),dv' h {v p )) 

— iVp )? ))• 

We claim that Jv is essentially the same as to be precise ， 


~ dv(Vp^ v ^py 


Probably the easiest way* to see this is to take a curve c in M with 〆(() ） = v p . 
Then 

(Vp) 卜 C )’ ⑼ 

厂 6?(V 2 oC) 6?(V 3 oC) \ 

V dt /== o dt . t - 0 dt t~o/ v(p) 

— (〜( d )， 〜(口2)，〜(口3))1>(/;). 

* A more formal way is to introduce the inclusion map j \ 5 2 — > R 3 , and observe that 
we are really trying to prove that 

The result then follows from Problem 1.4-3, applied to the component functions of v. 
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Taking advantage of the identification of M 3 with M 3 ^, we can introduce one 
more confusion, and consider Jv as a map 

dv’.Adp — > . 

Since dv(v p ) is parallel to 〜(％) e 5* 2 心 >， and since is parallel to M p , 

we see that we actually have a map 


dv: M p — M p . 



Despite the tortuous process used to define this map, the net result can be 
described very simply; dv{v p ) is just v^{v p ) moved back up to a parallel vector 
in M p . 

The map dv : M p M p is sometimes called the Weingarten map. Using it, 
we can define a tensor II on M which is covariant of order 2: for v p ,w p e M p 
we define 


\\{p)(v p ,w p ) = - {dv{v p ),w p ). 

Notice that II does depend on the choice of v, and hence on the orientation 
picked for M. This tensor II is called the second fundamental form of M. But, 
as we shall soon see, it is not alternating, and hence not a 2-form. We can see the 
reason for the word “form” in the classical terminology by looking at another 
tensor that you may be worrying about. The first fundamental form I of M 
is just the induced Riemannian metric / *{ , ) on M (where { , ) is the usual 
Riemannian metric on M 3 ). Thus 


， 出夕） = {^ p -> ^ p ) [ = 〈 u ， 出〉] • 

Classically, one worked not with I and II but with the functions v I(u, v) and 
v \-^ II(u, u), which are “quadratic forms” in the components of v (compare 
Pg. 1-314). 

The second fundamental form provides us with a name for a quantity that 
appeared in Chapter 2: 
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0. PROPOSITION. Let c be a curve in M which is parameterized by arc- 
length. Let c(0) = p, and let X = 〆 ⑼ e M p . Then 

〈，⑼ , v(p)) = ll(X^X) [i.e., U(p)(X,X)l 

Consequently, ll(X,X) is the signed curvature kx of the curve cut out on M 
by the normal plane through v(p) and X (with X, v{p) positively oriented). 
Moreover, if is the curvature of the curve cut out by the plane which 
contains X and makes an angle of 0 with the normal plane, then 

Ktp - COS0 = KX. 

PR OOF. C learly 

( 1 ) 

Since 

〈(’⑴, * ⑴)〉 = 0 for all s, 

differentiation yields, using (1), 

= ~(X,dv(X)) 

= ll(X, X). 

This, of course, is precisely the result (proved in precisely the same way) which 
was derived in Chapter 2. The rest of the theorem is proved just as before. 

Unlike Meusnier’s Theorem, which involves II in a trivial way, Euler’s The¬ 
orem involves a crucial property of II: 

1. THEOREM. The second fundamental form II of M is symmetric, 

II(/0(A ， 4) =： ll(p)(Y p ,X p ) for Y p e M p . 

FIRST PROOF. Near p we can represent M as the image of a function /: M 2 ^ 
R 3 . Let N = v o /, so that N is considered as a function on M 2 ”. Then 



a ^(^(^)) 
ds 


dv(X) 


5=0 
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Now we clearly have 


N 




0. 


Differentiating with respect to s gives 

a 2 / \ IdN 


N, 


> 


3 /\ 


3x3，/ \ 3^ 

Exactly the same argument gives 

3 2 / 

/ \ I a r J 

⑻ 

Thus 


dv 


mMm- 


-Am- 


3/ 

3^ 

Since 3 / /ds, df/dt are a basis for the tangent space of M at each point, II 
is symmetric. For later use we also note that similar arguments lead to the 
equations 


(b) 


II 


3/ 

3^ 


I) 


N. 


( c ) 


av 

ds 2 

a 2 / 


n ( 昝，荽 )=1 〜 

Equations (a)—(c) are called the Weingarten equations. 

SECOND (COORDINATE FREE ， FANCY) PROOF. Let K be a vector field in a 
neighborhood U C M of /? whose value at p is Y p , and such that Y(q) e M q 
for all q e U. Since 

(v, K) = 0 on t/, 

we have 


0 = X p ({v,Y)) = {X p (v\Y p ) + (v{p\ X P (Y)) 

= (dv{X p \Y p ) + (v( P \X p {Y)), 

where X p {Y) denotes the vector whose / th component is X p {Y l ), for Y l the / th 
component of Y. This shows that 

ll{ P )(X p ,Y p ) = (v(p\X p (Y)). 

If a vector field X is picked similarly, then we have a corresponding equation, 
and it follows that 

ll(p){X p ,Y p ) - ll( P ){Y p , X p ) = (v(pl X P (Y)- Y P (X)). 
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It is easy to check that in R n we have X p {Y) — Y P {X) — [X, Y](p), so that 

II ⑻ (A> ， 4) - ll(p)(Y p ,X p ) = (v(p), [X, Y](p)). 

But the right side is 0, since [X, Y]{p) g M p . ❖ 

The symmetry of II states an important property of the map dv: M p M p ] 
this map is self-adjoint, 

{dv(X),Y) = {X,dv(Y)), X,Y e M p , 

Recall that if K is a vector space with an inner product ( , ), then a linear 
transformation T \ V ^ V called self-adjoint (with respect to ( , )) if 

{T v, w) = (u, T w) for all v,w eV. 

This is equivalent to saying that the matrix of T is symmetric with respect to 
any orthonormal basis. It is an elementary fact that eigenvectors and V 2 of T 
with distinct eigenvalues must be orthogonal, for if TVj = XjVj with k\ ^ 入 2 , 
then 

^•1(^1,1^2) = {Tv\,v 2 ) = (i；i, Tv 2 ) = A. 2 (^i, 1^2). 

The main theorem about self-adjoint transformations (the “spectral theorem”) 
states that a self-adjoint T : V ^ V has a basis of eigenvectors V\,... ,v n . We 
have seen that eigenvectors with distinct eigenvalues are orthogonal. If two or 
more eigenvectors have the same eigenvalue, then all vectors in the subspace 
they span are eigenvectors, so we can select an orthogonal collection spanning 
the subspace. If we also choose our eigenvectors to be of unit length, we thus 
obtain an orthonormal basis of eigenvalues. Applying this to dv: M p — M p 
we see that there is an orthonormal basis X \, Xj of M p with 

dv(Xi) = kiXi. 

This fact is what is behind 

2. THEOREM (EULER). The curvatures kx have a minimum k\ in one di¬ 
rection and a maximum kj in a perpendicular direction. For a direction X 
making an angle of 9 with the first direction we have 

kx = k\ cos 2 9 ki sin 2 9. 

PROOF. Let X\ and X 2 be unit eigenvectors of II. Then by Proposition 0 

KXi = 11(^/*, Xi) = - {dv{Xi), Xt) 

=—(A/ - Xi, Xt) 
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If we express any other unit vector X e M p as 

X — (cos0)Zi + (sin 0)Z 2 , 


then 


K X =ll(X,X) = -(dv(X\X) 

=-(A 1 (cos 0)^1 +A 2 (sin 9)X 2 , (cos9)X l +(smd)X 2 ) 
— —k\ cos 2 9 — 1.2 sin 2 9. 

As before, this completes the proof.* ♦ 


To connect the curvatures k\,k 2 with K, we first note that K can be expressed 
very succinctly in terms of dv. 


3. PROPOSITION. The Gaussian curvature K(p) at G M is 

K(p) = determinant of dv: M p — M p . 


PROOF. If Y\,Y 2 g M p are linearly independent, then 


K(p) = 


v*(a f )(p)(Y u Y 2 ) 

dV(p)(Y u Y 2 ) 


dV(p)(Y u Y 2 ) 


Using the fact that is parallel to M p ， and remembering that we are 

considering Jv as a map into M p , this can be written simply 


K(p) 


dV(p)(dv(Y x \dv(Y 2 )) 
^ dV(p)(Y u Y 2 )^ 


Since dV is a. 2-form, this ratio is indeed just detdv. 


* Notice that we have reduced Euler’s Theorem to a fact about the eigenvalues k\ S 乂 2 
of a self-adjoint transformation T \ V ^ V om. 2-dimensional vector space: 

入 1 = min (Tv, i>), 入 2 = max (T i>). 

IMI=i !MI=i 

For higher dimensions there is a minimax definition of the various eigenvalues. See 
Courant, Uber die Abhangigkeit .. . , Nachrichten, Koniglichen Gesselschaft der Wis- 
senschaften zu Gottingen, Math. Phys. Klasse 1919, pp. 255—264. 
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4. COROLLARY. Let Fi, F 2 e M p be orthonormal. Then 


^) = det(II(r / ,r；)) = det 


II ⑺， : Ii(r l 5 r 2 ) 
11(^2,^) ii(r 2 , Y 2 ) 


PROOF. Since II(y/, Yj) — (~dv(Yi),Yj), the matrix of dv with respect to 

ri,r 2 is 

—11 ⑺，: ro -ii(r 2 ,ro 
-11 ⑺， r 2 ) -ii(r 2 ,r 2 ) 

which has the same determinant as the matrix (II(F/, p)).♦:• 


( 


5. COROLLARY. Let Y\,Y 2 e M p be linearly independent. Then 


K{p) 


detCIIC^, Yj)) ^ det(II(^ ? Yj)) 
det((Yi,Yj)) det(I(^, Yj )) - 


PROOF. Corollary 4 proves the formula for orthonormal If Yi = 

ciij Xj , then replacing X \, X 2 by Y\, Y 2 multiplies both numerator and de¬ 
nominator by det(a/；)- ^ 

6. COROLLARY. For any /? g M we have 

— k\ • /c2» 

PROOF. Apply Corollary 4 when Y\^Y 2 G M p are the orthonormal basis of 
eigenvectors of dv, with eigenvalues —k\ ， —A ： 2 . Then 


K(p) = det 


-k\ 

0 


0 

-/C2 


k\ - ki. ❖ 


Corollary 5 allows us to develop an explicit formula for which involves 
some standard symbolism to be introduced first. If x = Cv,^) is a coordinate 
system on M, then we write the first fundamental form I as 

\ — E dx ® dx + F dx ® dy F dy ® dx G dy ® dy. 


If /: M 2 ^ E 3 is the inverse of y, so that 


3/373/I ‘ 


3 
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then 


OS I \ OS / 

t 1 


F = y^Jl 


亡(竽 丫. 

\ ot J 

t 1 


[Notice that the left sides of these equations really mean E[f (s ， tY), etc. (the 
functions E, F, G themselves are defined on M). This is just the form we want. 
For example，to compute dE/dx at g = f(s,i) we have 

= D\{E o x _ 1 )(x(^)) 
ox 

=IME 。 f)( X (q)) 
os 

the E on the last line denotes the same function on R 2 which appears in the 
equations (*)]• 

The symbols E, F, G were introduced by Gauss himself (at the beginning of 
section 11 of his paper)，and they have remained standard ever since. There are 
also standard symbols for the second fundamental form: 

ll = l dx dx m dx dy m dy dx n dy dy. 

To obtain formulas for /, m, n, we look at the Weingarten equations in the first 
proof of Theorem 1， and note that 


+ 

3 g2 ri 

十 j osot 

i — \ 

3 32 fi 


aV 

ds 2 


3 




d 2 f 

ds 2 
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formulas for v l {f{s,t)) have already been given on pages 113—114. Using Corol¬ 
lary 5, we now have the classical formula 


K(p) = 


ln _ m 2 … 
EG . 


The symbols /, m, « do not appear in Gauss, who instead uses the symbols 
D, D f ， D ff for certain quantities proportional to them. These symbols are 
introduced on page 87; it is easy to see that Gauss’ formula for K on this 
page is equivalent to the one we have just derived, although Gauss obtained it 
in a different way，by beginning with the formula which we derived on page 119. 

Our next theorem probably requires an apology in advance. The result looks 
amazingly unappetizing; it's hard to see why anyone would want it even if he 
had it, and the proof is merely an involved calculation. Nevertheless, we will 
justify its existence soon after proving it. The calculation appearing in the proof 
should be a lot easier to follow than Gauss，. 


7. THEOREM. Let (x, y) be a coordinate system on a neighborhood of ^ G 
M C M 3 , and let 

I — /*( , ) — E dx ^ dx F dx ^ dy F dy ^ dx G dy ^ dy. 


Then 


4{EG - F 2 ) 2 K = E 


dE dG dFdG 


dy dy dx dy 
F . dE dG dEdG 


( 


dG_ 

dx 



G 


dx dy 
dE dG 


dy dx 
dE dF 


dE dF dF dF dFdG 
dy dy 3^ ^ dx dx 


dx dx dx dy \ 3^ 


( 


dE 



-2(EG - F 2 ) 

PROOF. Consider the inverse /: M 2 
save space we will denote 


d 2 E d 2 F d 2 G 


3 少 


2 


dxdy dx 2 


M 3 of the coordinate system (x ， 少 ). To 


9 2 / 

3 ^ 3 ^ 


D\ 2 f simply by / 12 , 
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and similarly for other partial derivatives. We have 


K 


In — m 2 
EG — F 2 , 


where, by (**)， 




(fn^N) = (/n, 


m = (/12, N ) = ( /12, 


« = (/ 22 , N) = ( / 22 , 


/l X /2 
y/EG - F 1 

h 乂 h 

y/EG - F 2 

A x fi 

y!EG - F 1 


Thus 


2\2 


K(EG - F z ) 


〈 /ll，/l X / 2 〉 • (/22, f\ X fl) - 〈 /l2，/l x /2〉 2 

/n\ (f 2 i\ (fn\ (hi 

det I /i I • det I f\ | - det f\ | • det f\ 


fi 


h 


h 


fi 


In this equation each fi and fij is considered as a row of the matrix. If we use 
fi 1 and fi, to denote the columns with the same entries，then we also have 

fn\ (f\2 

K(EG - F 2 ) 2 = det\ A •det(/ 22 t ，/ 1 t ，/ 2 t )_det /i I • det^z 4 ,/i*,/ 2 4 ) 


fi 


fi 


det 



det 


〈 /ll ， /22 〉〈 /ll ， /l> (/ll ? fl) 

(fufn) E F 

{fi, hi) F G 

(/l2, f\l) (/l2, /l) (/l2, fl) 

det I (/ 12 ,/i) E F 

ifii.fi) F G 
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det 


〈 /ll,/22> - 〈 /l2,/l2> 〈 /ll,/l> 〈 /ll,/2> 


〈/l，/22> 

{fl, fll) 

0 

det I (/ 12 ,/i> 
(/l2, fl) 


E 

F 


F 

G 


〈 /l2 ， /l> (/l2, fl) 

E F 

F G 


But from the definitions of E, F, G in (*) we have 


{f u Jx) = \E x (E' = dE/ds) 
(/l2, f\) = jE 2 
(/22, fl) = \G2 
(/l2, fl) = \G\ 

(fu ， fi) = F'-\E 2 
{fn ， f\) = F 2 - \G\. 

Moreover, from the fourth and fifth equations we obtain 


2 


G n 


7^12 — 士五 22 


ds 


dt 


(/l2, fl) — (/l21, fl) + (/l2, fl\) 


〈 /ll ， /2> = (/ll2, fl) + 〈 /ll ， /22 〉 


Subtracting the first of these from the second, we then obtain 

〈 /11 ， fn) — 〈 /i2, /12〉 = —\G\\ + ^\2 — I 五 22. 


So we have, finally ， 


K(EG — F 2 ) 2 = det 


\G\\ + F\2 — \ \E\ F\ — jE2 

F 2 _ ^ (ji E F 


\Gi 


F 


G 


0 


2 


E 2 4Gi 


2 


— det I - 五 2 E 


2 


G\ F 


F 

G 


which gives the formula in the statement of the theorem! ^ 
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From Theorem 7 we can deduce an immediate Corollary, which appears on 
page 93 of the translation: 

“Thus the formula of the preceding article leads of itself to the remarkable 

THEOREM. If a curved surface is developed upon any other surface whatever, 
the measure of curvature in each point remains unchanged.” 

The Latin word for ‘remarkable’ has become part of the traditional name for 
this result: 

8. COROLLARY (THEOREMA EGREGIUM). If f 7 g: M ^ R 3 are two 
imbeddings (or even immersions) such that /*〈 ，〉 = g*〈 ，〉， then the Gauss¬ 
ian curvature of f(M) C M 3 at f(p) equals the Gaussian curvature of g(M) 
at g(p). 

The Theorema Egregium justifies the close attention which we have given 
to the Gaussian curvature of a surface. Although defined in terms of the 
imbedding of the surface in R 3 , it turns out to depend only on the Riemannian 
metric induced by that imbedding. This shows why the Gaussian curvature of 
a cylinder must be 0 — there is a (local) isometry from the plane to the cylinder: 
If the cylinder Z C M 3 is 

Z = {(x,y,z) : x 2 + 少 2 =a 2 }, 

then a local isometry /: M 2 ^ Z is given by 

f(s, t) = (a cos - , a sin - , A. 

V a a / 

It is easily checked that /* is an isometry at any point, but the result should be 
clear without any calculations whatsoever. To prove that Z is locally isometric 
to M 2 , just take a piece of paper，and roll it up into a cylinder. The map which 
takes a point on the flat piece of paper into the corresponding point on the 
rolled up piece of paper is an isometry. The isometric properties of this map 
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are expressed by the everyday experience that paper cannot be stretched, but 
merely bent. 



The Theorema Egregium is often expressed by saying that Gaussian curva¬ 
ture is a “bending invariant”. Anyone who has ever made a paper dunce hat 
knows (though he may not know that he knows) that the cone is also locally 
isometric to the plane, and hence has Gaussian curvature 0. 



Map makers, and anyone who has had to wrap a spherical object, know that 
a piece of paper cannot be bent onto even a small portion of a sphere. A math¬ 
ematical proof follows immediately from the Theorema Egregium, for a sphere 
has non-zero Gaussian curvature. The situation for surfaces is thus completely 
different from that for curves. All 1-dimensional Riemannian manifolds are lo¬ 
cally isometric to M 1 , for if we choose an immersed curve c in the manifold, 
then the arclength function of c is an isometry into M 1 • So there are no inter¬ 
esting bending invariants of a curve; all the interesting characteristics of a curve 
are invariants under the group of Euclidean motions. The Gaussian curvature 
is, of course, an invariant under the group of Euclidean motions, but it is also 
invariant under the much larger (but still important) group of maps which are 
merely defined on the surface, and are isometries there. (As a contrast, the 
mean curvature j(k\ + ki) is invariant under the group of Euclidean motions, 
but it is not a bending invariant; for example, the plane has mean curvature 0, 
while a cylinder of radius a has mean curvature a/2) 




134 


Chapter 3 ， Part B 


dE(a(u)) 
du M=0 



\ 

〆 ’(,)， 


using integration by parts. 


In the previous paragraph we appealed to the Theorema Egregium to prove 
that a sphere is not locally isometric to the plane. But it is also possible to 
give a much more elementary proof of this fact，that will convince some one 
who knows a little geometry. If there were an isometry from a neighborhood 
of p e S 2 into the plane, then a small triangle ABC around p, with portions 



of great circles as sides，would have to be mapped into an ordinary triangle 
A!B f C f on the plane, since great circles are geodesics on the sphere. The angles 
at A, 5，C would also have to equal the angles A f , B f ， C f • This is impossible, 
since /.A + ZB + ZC > 7r, while /.A f + ZB’ + AC f = it. This phenomenon turns 
out to have a generalization to arbitrary surfaces, a result that Gauss felt “ought 
to be counted among the most elegant in the theory of curved surfaces’’. In 
deriving this result we will essentially follow Gauss, but we will use some of our 
previous results about geodesics，and suppress some of the additional formulas 
which Gauss obtains along the way，so that the argument may appear somewhat 
simpler. 

In expounding the theory of geodesics，Gauss derives two conditions，of en¬ 
tirely different natures, for a geodesic on a surface. Although the first of these 
conditions is not necessary for our final goal, it is an interesting exercise in the 
calculus of variations, as well as an interesting result in its own right. We con¬ 
sider a curve c : [a^b] M and a variation a keeping endpoints fixed. Looking 
at the energy function (Gauss looks at length instead), we have 



This result holds for any variation a keeping endpoints fixed. The final integral 
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must therefore be zero whenever 3a/3w(0, t) e ⑴ for all since there is 
then a variation a through curves on M with these values of da/du(0,t). In other 



words, y is a geodesic on M if and only if 

f r](t)) dt = Q for every r] satisfying (rj(j) ， v(y(t))) = 0. 

Ja 

In particular, we can choose 

m = m[y ff if) - {y ff (t)Mym^y(t))] 

where 0 is a C°° function on [a, b] with 0 > 0 on (a, b) and </>(a) = ♦(!?)= 0. 
We then obtain 

rb 

o= / 0(O[(y // (O,/’0)> — 〈 〆’(>) ， ^(y(t))) 2 ]dt. 

Ja 

Since u(y(/)) has length 1, the Schwarz inequality (Theorem 1.9-1(2)) shows 
that the term in brackets is always < 0. Since (j>{t) > 0 on (a, b), the term in 
brackets must actually be 0 everywhere. This implies that y ff {t) and u()/(^)) are 
everywhere linearly dependent In other words, 

The curve y on M is a geodesic if and only if 
y f, is always perpendicular to M. 




Notice that this condition makes sense only for a surface in M 3 ; the vector y ff 
has no meaning for an abstract Riemannian manifold. 
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We now pass to the equations for a geodesic on any 2-dimensional Riemann- 
ian manifold. However, we will work with a special coordinate system. Consider 
a neighborhood oi p e M which is exp([/), where U C M p is a neighborhood 
of 0 on which exp is one-one. Identify M p with R 2 by choosing an orthonormal 
basis for M p . Introducing polar coordinates (p,0) on M p (minus some ray), we 
obtain a coordinate system (r,cp) = (p,0) o exp 一 1 on exp(C7) (minus some geo¬ 
desic ray), where p is now chosen so that p = 1 for vectors in M p with norm 1. 



This implies that ||3/3r|| = 1. We also know that 〈 3/3r ， 3/3 史 〉 = 0, by Gauss’ 
Lemma. So we have 

〈 ， 、 =dr ⑧ dr G d(p ⑧ dcp 

for some function G. The function G is just G(q) = (d/dcp\ q ， 3/3 史 | 《〉 . Clearly, G 
can be considered as defined on exp(f/) — {p}, even though any particular 
coordinate system (r, (p) can be defined only on exp(f/) minus some geodesic 
ray. In terms of the gij notation we have 
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The equations for a geodesic (pg. 1.329) thus give 


( 1 ) 


d 2 y x 1 dG 


ds 2 


2 dr 


( K ⑴） 



Now suppose y is some curve parameterized by arclength, and let 0(^) be 
the angle between y r {s) and 3/3r| y ⑴. Clearly 



( 2 ) 


cos^,) = (/(,), - 


dy x d 


y(^) 


d 

dy 2 d 

d 

dr 

y(s) ds d( P 

y ⑷ ’ 


y(^) 


dy x 

= ~di' 

For a geodesic y we obtain from (1) and (2) 


( 3 ) 


1 dG 


2 dr 


(y(s)) 



d cos^^) 
ds 

— sin 0(5 , ) 


d9(s) 

ds 


Finally, note that the area of the parallelogram spanned by the unit vectors y f {s) 
and 3/3r| y ( s ) equals 


(4) sin 0(5") and also equals dV 


dr 


Y f (s) 




\/G(y(s)) dr A dip 


dr 


dy x d 


L dy 2 d 

y(s ) 5 ds dr 

Y(s) 

h ds ^ 


y(s) 


^G(y(s)) 


dy 2 

ds 
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So we obtain 


1 dG 

2 dr 




- 烏⑼ 


dy 2 dO 
ds ds 


d9 

ds 


dG 


丄 dr 

27g 


(y(s)) 


dy 2 

~di' 


and thus 


(*) 


de 

ds 


aVc 

dr 


(y(s)) 


dy 2 

ds 


for any geodesic y. This is the equation Gauss finally obtains, on page 107 of 
the translation. 

Gauss also obtains the expression for K in this special coordinate system. 
Theorem 7 now takes the much simpler form 


4G 2 K = 



-2G 


d 2 G 


which gives 



It will now be necessary to obtain some further information about the func¬ 
tion y/G, for which Gauss gives very brief arguments. We will find it convenient 
to express G “as a function of p and 0 ”； that is, we consider 

g = G o exp o P - 1 ， 

where P : M p -ray ^ (0, 8] x (0, In) is P = (p, 0), so that g(po, 0o) isG(expu )， 
where v e M p has polar coordinates (po, 0o). At times it will also be convenient 
to use (p ， 0) to stand for a point in M p , as well as standing for the coordinate 
functions themselves. 

The function g can be considered as defined on (0, S] x [0, 2n] (with the 
same values at (p,0) as at (p, 2n)). What we want to examine is the behavior 
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of g near (0,0). We do this by comparing distances on M p with those on M. 
On the vector space M p we have an inner product ( , ) p \ since the tangent 
space {M p ) v of Mp at i; g M p can be identified with M p , we can use ( , )^ to 
obtain a Riemannian metric on M p . To keep things straight, we will use v, w 
to denote elements of M p , and X v , Y v to denote tangent vectors in the tangent 
space (M p ) v . For each X v e {M p ) v we thus have a certain norm, which we 
will denote by IIIA^HI. Now recall that exp*: (M p ) 0 M p is the identity map 
(when we identify (M p )q with M p )^ so that we certainly have 

lll^olll = ||exp,(^ 0 )||^ A^o g (M p ) 0 . 


If e > 0, it follows that for v e M p sufficiently close to 0 and X v e (M p ) v of 
unit norm III ||| we have 


I III nil - llexp^C^llexpCu) < 

so that for any Y v e {M p ) v we have 

lllhll 卜 Ikxpjmxp ⑻ I < f |||yj|. 
Noting that exp* (3/301 = 3/3p| exp ⑻， we have 

3 3 

—— < s • — 

d< P expCu) M v 



for all sufficiently small v, while clearly 




140 


Chapter 3, Part B 


Dividing all terms of (1) by p yields 




< A 


for all sufficiently small p. Since this is true for all e > 0, we have thus shown 
that 

(2) = P + o(p) 7 

where o(p) denotes a function on (0,5] x [0, 2n] such that 

lim — 0 (uniformly in 0). 
p->o p 

Clearly ^/g remains continuous on [0,5] x [0, 2ti] if we define 



V^(O,0)= 



Notice, moreover, that equation (2) now immediately implies that 



3y/g 


(0,0) 


(where 3^/g/dp(0 7 <j>) really denotes a right hand derivative). However, we want 
to know that 3^/^/3/> is actually continuous on [0,5] x [0, 2n]; the argument for 
this will require another step. 

From equation (**) we have, on (0,5] x [0, 2n] y 


[where exp(/>,0) really means 

(p ， 0) = ~Jg{p,4>) - A ： (exp(/>,0)) exp(u), where v e M p has 

polar coordinates (p ， 0 )]， 



which shows that 3 2 ^/^/3/> 2 (/>, 0) - > 0 as /> -> 0. It follows, in particular, that 
3 2 3/> 2 is bounded. For /> > 0 we have 


穿 (秦 ㉟ ， 0)— 


9V^ 

dp 2 


0) dt, 
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which immediately implies that 

lim —— (p ， 0) exists. 

p ^0 up 

By a standard theorem of calculus (see e.g” Spivak ， Calculus, 3 rd ed. ， pg. 200)，the 
limit must be d^/g/dp(0, 0), which equals 1 ， by (****)• It follows, in particular, 

po 

\p > q))ap = im 

o 


that 

f 氺氺氺氺氺 




2 


,po d 2 /? 

(p,<p)dp = lim / — - {p^)dp 


dp 2 


lim - / >o,0) 


►o dp 




(Po ， 0) 


We are ready to prove a theorem. 


9. THEOREM. Let A y C be three points of exp(f/)，where f/ is a convex 
neighborhood of 0 g Ma, on which exp is a diffeomorphism，and let a be a 
geodesic in exp(f/) between B and C. Denote the geodesic segment from A 
to C by and the geodesic segment from A to B by and let AABC be the 


“geodesic triangle” bounded by a ， 0 ， y. Also let ZA denote the angle between P 
and y, etc. Then 

C 



PROOF. Let 6,c g M p be the vectors with exp(6) = B and exp(c) = C. 
Choose the polar coordinates />, 0 on M p so that <j)(b) = 0. Then 0(c) is 
just ZA. 



c 
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It is easy to see that a cannot intersect the same geodesic ray through A 
twice, so it must be the image under exp of a curve in Ma which is the graph 
p = f (0) of some function in the polar coordinates p ， 0. Let 9(</>) be the angle 
between d/dr\ q and ot\q), where q is exp of the vector with polar coordinates 

(/W ， 0). 

We clearly have 


0(0) = 71 - IB, 9(ZA) = ZC. 


Consequently ， 



KdV = txp*(KdV) 

AABC J exp -1 (A ABC) 


-i(A^C) s/g OP 


r exp 

/ r f(4>) n2 



Vs 


F 0 vo 

C ( 


dp 2 


(p,0) dp d<t> 


9V| 


(f (♦)，♦)、 


by (**) 


by (****) 


=乂（ 1 + 而叫抑 by (*) 

=ZA + 0(ZA)-9(0) 

=AA + ^-C ZB — 7t. ♦♦♦ 


According to Theorem 9， on a surface with everywhere positive curvature 
the sum of the angles of a triangle with geodesic sides is always > 丌 ， while 
on a surface with everywhere negative curvature the sum of the angle is al¬ 
ways < 7r. This certainly looks like the case in pictures, and one can even see 
that the bigger the triangle, the bigger the difference between /.A + ZB + 
and 7r. 
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Notice that in the proof of Theorem 9 we are essentially converting an integral 
over the region AABC into an integral over (one of) its sides — this looks suspi¬ 
ciously like Stokes’ Theorem. In Volume III， we will indeed be able to present a 
much nicer proof of Theorem 9, which does not depend on a special coordinate 
system, and which makes explicit the role of Stokes’ Theorem. Moreover，we 
will be able to derive other important consequences of the same results. How¬ 
ever, for the moment, we are more interested in two questions. How did Gauss 
think of the Theorema Egregium? and What does it really mean? 

The paper which we have just examined was based on an earlier paper which 
Gauss did not publish (also included in the Princeton University Library trans¬ 
lation). From the earlier paper it appears that Gauss first proved the result in 
Theorem 9. Notice that this result gives the Theorema Egregium as a corollary, 
for it implies that 


K(p) = 


v ZA + ZB + ZC -7t 

lim - 

AABC^p area AABC 


and this limit is defined totally in terms of the Riemannian metric on the sur¬ 
face. After realizing this, Gauss probably said to himself: “Since 尺 depends 
only on the metric, it should be possible to show this by a direct computation; 
and if any one can do the computation, I certainly can.” 

To answer the second question — What does the Theorema Egregium really 
mean? — requires a more serious effort. The original definition of K seemed 
perfectly satisfactory — it has immediate geometric appeal and is even fairly easy 
to compute. The only defect of the definition is that the concept being defined 
turns out to be too good; it turns out to be invariant under isometries, while its 
definition is not. 

One of the dogmas of modern mathematics is that for any object that is invari¬ 
ant in any sort of way，a definition must be found that exhibits this invariance 
directly (even if such a definition is harder to understand than the original!). 
The definition of determinants is a good example. An elementary treatment of 
determinants usually begins by defining the determinant of a matrix, either by 
writing down a messy formula, or in an inductive way that really amounts to 
the messy formula. It is then shown that det(^4 - B) = det A - det B; from this 
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it follows that one can define the determinant of a linear transformation to be the 
determinant of its matrix with respect to any basis. This naturally leads one 
to seek a definition of the determinant of a linear transformation T \ V —> V 
which does not require a choice of basis. After one has defined it is 

possible to define det T to be the constant such that T* : ^ 

is det T times the identity. This definition is indeed independent of a choice 
of basis, but when one looks a little harder at it, one sees that it isn’t all that 
different from the messy formula defining the determinant of a matrix. Indeed, 
one proves that dim ^ 0 by writing down an explicit non-zero element 

of it (in terms of a basis!) which involves permutations in exactly the same way 
as the original definition of determinants. Finally, if one can tolerate even more 
complicated constructions, the “exterior algebra” of K can be used to produce a 
definition of det T which is completely independent of bases, and does not even 
mention permutations. In fact, the sign of a permutation a can then be defined 
as the determinant of the linear transformation T '.W 1 W 1 with T(e) = e a ^). 
This approach is expounded in Chevalley’s book, The Construction and Study of 
Certain Important Algebras. 

The definition of curvature follows a similar course. We first defined the cur¬ 
vature of a surface in a way which depends on an imbedding in M 3 ; our final 
goal is a definition of curvature which depends only on the Riemannian met¬ 
ric on the surface. It should be noted that we already have a candidate—the 
formula in the Theorema Egregium may be used as a definition of curvature! 
It must still be checked that this definition does not depend on the coordinate 
system (an uninviting task), but at least the definition involves only a coordinate 
system on the surface, not an imbedding of the surface in M 3 . Presumably, few 
mathematicians would accept this definition as a reasonable one. As we pro¬ 
ceed to frame more acceptable definitions of curvature, we will rely, just as in 
the case of determinants, on more complicated and abstract constructions, so it 
is important that we follow the historical evolution of the definition of curvature, 
in order not to lose sight of its geometric significance. 

One further comparison with the case of determinants will point out the di¬ 
rection which our investigations will have to take. It seems safe to assert that the 
modern invariant definitions of determinants would never have come into being 
if mathematicians had always considered only determinants of 2 x 2 matrices— 
our perception of the structure is complicated too much by the simplicity of this 
special case. Similarly for curvature. The essence of curvature is revealed more 
fully only when one transcends the limits of intuition and considers manifolds 
of arbitrary dimensions, conceived as existing in their own right, not as subsets 
of a Euclidean space. 
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ADDENDUM 

THE FORMULA OF 
BERTRAND AND PUISEUX; 
DIQUET 5 S FORMULA 


Consider the function ^/g which was used in the proof of Theorem 9. This 
function is defined on some [0, 8] x [0, 2n] and 


( 1 ) 


y/g(p,0) = ^/g{p,2it) for [0,5] 


( 2 ) 


^g(0,(j>) =0 


(3) 




3/> 


p—^o 3p 


We have also noted that on (0, 8] x [0, 2n] we have 


(4) 


9V| 

dp 2 


(p ， 0) = — v^(/> ， 0) ■ A ： (exp(/>,0)) 

= ■ 充 (P ， 0) ， say. 


This shows (appealing once again to the standard theorem of calculus used 
before) that 


(5) 


9Vf 

dp 2 


(0,0) = lim 

p ^0 


3 2 Vg 

dp 2 


(p,0) = 0. 


Differentiating equation (4) yields 


⑹ 


9V^ 

a/> 3 


(P ， 0)= 


Wg_ 

3/> 


(P ， 0) - 一 ^/g(p,<t>) 


dK 


(P ， 0). 


We note that the term dK/dp(p, (j>) makes sense even for p = 0; it is just a 
directional derivative of 尺 ， which is a C°° function on M p . Consequently, 
dK/dp(p, (p) approaches a limit as ^ 0. From (6) we thus obtain 
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( 7 ) 


9V| 

3/> 3 


(0,0) 


p^O op 5 


~K(p\ 


using (2) and (3), and noting that 

K(0^) = K(p). 


Using (2), (3), (5), (7), we now have the Taylor polynomial expansion 



[There is still a technical detail which must be taken care of. We actually want 
to know that the remainder 


= yfg{p,4>)~ P + 


6 ~~ 


satisfies 


lim 只 (?) 0 uniformly in <f>. 


p^o p D 

This can be seen from the proof that the Taylor polynomial approximates the 
function; the proof involves L’Hdpital’s Theorem, which in turn depends on 
the Cauchy mean value theorem [Calculus, pg. 201), whose role will be made 
explicit. We have 


lim 

p^O 


Vg(p, 0 ) - p-\- K(p)p 3 /6 


P : 


lim 


3/> 


(p^)-\ + K(p)p 2 /2 


3p 


2 


0 < p < />, by the 
Cauchy mean value theorem 


i2 




lim 

p^O 


dp 2 


+ k( p )5 


lim 

p^O 


d\/g 

3/> 3 


6p 


+ K(p) 


0< p< p 


6 


0 < p < p 


— 0, 

with the final equality coming from (7).] 
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10. PROPOSITION (BERTRAND AND PUISEUX ； 1848). Let C(p) be the 
circumference of the “geodesic circle” of radius p around p G M，consisting of 
the endpoints of geodesic segments of length p which start at p. Then 


PROOF. Clearly 


C(p) 


So 


K{p) = lim 3 


2np — C (p) 

7rp 3 


^2n 


^2n 


d(f 


P 


exp(/9,0) 

K(p)p 


^2n 

d<t> = / \fg{pA) d<t> 

T o 


2 n 


6 ) ^ + J 0 〆〆 ) 抑 


2n { p 


K(p)p : 


) 


+ °(P )• 




7tp' 


P' 


According to Proposition 10, on a surface of positive curvature, like a sphere ， 
geodesic circles are always “too small’’，while on surfaces of negative curvature, 
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they are always “too large”. Notice that Proposition 10 gives another interpre¬ 
tation of curvature totally in terms of the Riemannian metric on M. There is 
yet another formula of the same type. 


11. PROPOSITION (DIQUET ； 1848). Let A{p) be the area enclosed by the 
geodesic circle of radius p around p e M. Then 
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which easily yields the desired result. 



CHAPTER 4 

THE CURVATURE OF 
HIGHER DIMENSIONAL MANIFOLDS 


A. AN INAUGURAL LECTURE 


O n June 10 , 1854 the faculty of Gottingen University heard a lecture entitled 
Uber die Hypotheses welche der Geometrie zu Grunde liegen (On the Hypotheses 
which lie at the Foundations of Geometry). This lecture was delivered by Georg 
Friedrich Bernhard Riemann, who had been born just a year before Gauss’ 
paper of 1827 . Although the lecture was not published until 1866 ， the ideas 
contained within it proved to be the most influential in the entire history of 
differential geometry. To be sure, mathematicians had not neglected the study 
of surfaces in the meantime; in fact, Gauss’ work had inspired a tremendous 
amount of work along these lines. But the results obtained in those years can 
all be proved with much greater ease after we have followed the long series 
of developments initiated by the turning point in differential geometry which 
Riemann’s lecture provided. 

A short account of the life and character of Riemann can be found in the 
biography by Dedekind* which is included in Riemann’s collected works (pub¬ 
lished by Dover). His interest in many fields of mathematical physics, together 
with a demand for perfection in all he did，delayed until 1851 the submission 
of his doctoral dissertation Grundlagen fur eine allegemeine Theorie der Functionen einer 
veranderlichen complexen Grosse (Foundations for a general theory of functions of 
a complex variable). Gauss’ official report to the Philosophical Faculty of the 
University of Gottingen stated “The dissertation submitted by Herr Riemann 
offers convincing evidence of the author’s thorough and penetrating investiga¬ 
tions in those parts of the subject treated in the dissertation, of a creative, active 
truly mathematical mind, and of a gloriously fertile originality •” 

Riemann was now qualified to seek the position of Privatdocent (a lecturer 
who received no salary, but was merely forwarded fees paid by those students 


* Even for those who can only plod through German, this is preferable to the account 
in E.T. Bell’s Men of Mathematics^ which is hardly more than a translation of Dedekind, 
written in a racy style and interlarded with supercilious remarks of questionable taste. 
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who elected to attend his lectures). To attain this position he first had to sub¬ 
mit an “inaugural paper” (Habilitationsschrift). Again there were delays, and it 
was not until the end of 1853 that Riemann submitted the Habilitationsschrift, 
Uber die Darstellbarkdt drier Function durch due trigonometrische Reihe (On the repre- 
sentability of a function by a trigonometric series). Now Riemann still had to 
give a probationary inaugural lecture on a topic chosen by the faculty, from a 
list of three proposed by the candidate. The first two topics which Riemann 
submitted were ones on which he had already worked, and he had every reason 
to expect that one of these two would be picked; for the third topic he chose the 
foundations of geometry. Contrary to all traditions, Gauss passed over the first 
two, and picked instead the third, in which he had been interested for years. 
At this time Riemann was also investigating the connection between electric¬ 
ity, magnetism, light, and gravitation, in addition to acting as an assistant in 
a seminar on mathematical physics. The strain of carrying out another ma¬ 
jor investigation, aggravated perhaps by the hardships of poverty, brought on a 
temporary breakdown. However, Riemann soon recovered, disposed of some 
other work which had to be completed, and then finished his inaugural lecture 
in about seven more weeks. 

Riemann hoped to make his lecture intelligible even to those members of 
the faculty who knew little mathematics. Consequently, hardly any formulas 
appear and the analytic investigations are completely suppressed. Although 
Dedekind describes the lecture as a masterpiece of exposition, it is questionable 
how many of the faculty comprehended it. In making the following translation,* 
I was aided by the fact that I already had some idea what the mathematical 
results were supposed to be. The uninitiated reader will probably experience a 
great deal of difficulty merely understanding what Riemann is trying to say (the 
proofs of Riemann’s assertions are spread out over the next several chapters). 
We can be sure, however, that one member of the faculty appreciated Riemann’s 
work. Dedekind tells us that Gauss sat at the lecture “which surpassed all his 
expectations, in the greatest astonishment, and on the way back from the faculty 
meeting he spoke to Wilhelm Weber, with the greatest appreciation, and with an 
excitement rare for him. about the depth of the ideas presented by Riemann”. 


*The original is contained, of course, in Riemarm’s collected works. Two English 
translations are readily available, one in \ olume 2 of Smith’s Source Book in Mathematics 
(Dover), and one in Clifford’s Mathematical Papers (Chelsea). 
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On the Hypotheses which lie at 
The Foundations of Geometry 

Plan of the Investigation, 

As is well known, geometry presupposes the concept of space, as well as 
assuming the basic principles for constructions in space. It gives only nominal 
definitions of these things, while their essential specifications appear in the form 
of axioms. The relationship between these presuppositions is left in the dark; 
we do not see whether, or to what extent, any connection between them is 
necessary, or a priori whether any connection between them is even possible. 

From Euclid to Legendre，the most famous of the modern reformers of 
geometry, this darkness has been dispelled neither by the mathematicians nor by 
the philosophers who have concerned themselves with it. This is undoubtedly 
because the general concept of multiply extended quantities, which includes 
spatial quantities, remains completely unexplored. I have therefore first set 
myself the task of constructing the concept of a multiply extended quantity 
from general notions of quantity. It will be shown that a multiply extended 
quantity is susceptible of various metric relations, so that Space constitutes only 
a special case of a triply extended quantity. From this however it is a necessary 
consequence that the theorems of geometry cannot be deduced from general 
notions of quantity, but that those properties which distinguish Space from other 
conceivable triply extended quantities can only be deduced from experience. 
Thus arises the problem of seeking out the simplest data from which the metric 
relations of Space can be determined, a problem which by its very nature is not 
completely determined，for there may be several systems of simple data which 
suffice to determine the metric relations of Space; for the present purposes, 
the most important system is that laid down as a foundation of geometry by 
Euclid. These data are — like all data — not logically necessary, but only of 
empirical certainty, they are hypotheses; one can therefore investigate their 
likelihood, which is certainly very great within the bounds of observation, and 
afterwards decide upon the legitimacy of extending them beyond the bounds of 
observation, both in the direction of the immeasurably large，and in the direction 
of the immeasurably small. 

I. Concept of an n fold extended quantity. 

In proceeding to attempt the solution of the first of these problems, the 
development of the concept of multiply extended quantity, I feel particularly 
entitled to request an indulgent hearing, as I am little practiced in these tasks 
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of a philosophical nature where the difficulties lie more in the concepts than 
in the construction, and because I could not make use of any previous studies, 
except for some very brief hints on the subject which Privy Councilor Gauss has 
given in his second memoir on Biquadratic Residues, in the Gottingen Gelehrte 
Anzeige and in the Gottingen Jubilee-book, and some philosophical researches 
of Herbart. 


Notions of quantity are possible only when there already exists a general 
concept which admits particular instances. These instances form either a con¬ 
tinuous or a discrete manifold, depending on whether or not a continuous tran¬ 
sition of instances can be found between any two of them; individual instances 
are called points in the first case and elements of the manifold in the second. 
Concepts whose particular instances form a discrete manifold are so numerous 
that some concept can always be found, at least in the more highly developed 
languages, under which any given collection of things can be comprehended 
(and consequently, in the study of discrete quantities, mathematicians could 
unhesitatingly proceed from the principle that given objects are to be regarded 
as all of one kind). On the other hand, opportunities for creating concepts whose 
instances form a continuous manifold occur so seldom in everyday life that color 
and the position of sensible objects are perhaps the only simple concepts whose 
instances form a multiply extended manifold. More frequent opportunities for 
creating and developing these concepts first occur in higher mathematics. 

Particular portions of a manifold, distinguished by a mark or by a boundary, 
are called quanta. Their quantitative comparison is effected in the case of 
discrete quantities by counting, in the case of continuous quantities by measur¬ 
ement. Measuring involves the superposition of the quantities to be compared; it 
therefore requires a means of transporting one quantity to be used as a standard 
for the others. Otherwise, one can compare two quantities only when one is a 
part of the other, and then only as to “more” or “less”，not as to “how much”. 
The investigations which can be carried out in this case form a general division 
of the science of quantity, independent of measurement, where quantities are 
regarded, not as existing independent of position and not as expressible in terms 
of a unit, but as regions in a manifold. Such investigations have become a 
necessity for several paits of mathematics, e.g., for the treatment of many¬ 
valued analytic functions, and the dearth of such studies is one of the principal 
reasons why the celebrated theorem of Abel and the contributions of Lagrange, 
Pfaff and Jacobi to the general theory of differential equations have remained 
unfruitful for so long. From this portion of the science of extended quantity, 
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a portion which proceeds without any further assumptions, it suffices for the 
present purposes to emphasize two points, which will make clear the essential 
characteristic of an nfold extension. The first of these concerns the generation 
of the concept of a multiply extended manifold, the second involves reducing 
position fixing in a given manifold to numerical determinations. 

2 . 

In a concept whose instances form a continuous manifold, if one passes from 
one instance to another in a well-determined way, the instances through which 
one has passed form a simply extended manifold, whose essential characteristic 
is，that from any point in it a continuous movement is possible in only two 
directions, forwards and backwards. If one now imagines that this manifold 
passes to another, completely different one, and once again in a well-determined 
way, that is, so that every point passes to a well-determined point of the other, 
then the instances form, similarly, a doubly extended manifold. In a similar 
way, one obtains a triply extended manifold when one imagines that a doubly 
extended one passes in a well-determined way to a completely different one, and 
it is easy to see how one can continue this construction. If one considers the 
process as one in which the objects vary, instead of regarding the concept as 
fixed, then this construction can be characterized as a synthesis of a variability 
of 凡 + 1 dimensions from a variability of n dimensions and a variability of one 
dimension. 

3. 

I will now show, conversely, how one can break up a variability, whose 
boundary is given, into a variability of one dimension and a variability of lower 
dimension. One considers a piece of a manifold of one dimension — with a fixed 
origin, so that points of it may be compared with one another — varying so 
that for every point of the given manifold it has a definite value, continuously 
changing with this point. In other words, we take within the given manifold a 
continuous function of position, which, moreover, is not constant on any part of 
the manifold. Every system of points where the function has a constant value 
then forms a continuous manifold of fewer dimensions than the given one. These 
manifolds pass continuously from one to another as the function changes; one can 
therefore assume that they all emanate from one of them, and generally speaking 
this will occur in such a way that every point of the first passes to a definite point 
of any other; the exceptional cases, whose investigation is important，need not 
be considered here. In this way, the determination of position in the given 
manifold is reduced to a numerical determination and to the determination of 
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position in a manifold of fewer dimensions. It is now easy to show that this 
manifold has n — l dimensions, if the given manifold is an nfold extension. By 
an ntime repetition of this process, the determination of position in an nfold 
extended manifold is reduced to n numerical determinations, and therefore 
the determination of position in a given manifold is reduced, whenever this is 
possible, to a finite number of numerical determinations. There are, however, 
also manifolds in which the fixing of position requires not a finite number, but 
either an infinite sequence or a continuous manifold of numerical measurements. 
Such manifolds form, e.g.，the possibilities for a function in a given region, the 
possible shapes of a solid figure ， etc. 


II. Metric relations of which a manifold of n dimensions 
is susceptible, on the assumption that lines have a length 
independent of their configuration, so that every line 
can be measured by every other. 

Now that the concept of an n fold extended manifold has been constructed, 
and its essential characteristic has been found in the fact that position fixing 
in the manifold can be reduced to n numerical determinations, there follows, 
as the second of the problems proposed above, an investigation of the metric 
relations of which such a manifold is susceptible, and of the conditions which 
suffice to determine them. These metric relations can be investigated only 
in abstract terms, and their interdependence exhibited only through formulas. 
Under certain assumptions, however, one can resolve them into relations which 
are individually capable of geometric representation, and in this way it becomes 
possible to express the results of calculation geometrically. Thus, although an 
abstract investigation with formulas certainly cannot be avoided, the results can 
be presented in geometric garb. The foundations of both parts of the question 
are contained in the celebrated treatise of Privy Councilor Gauss on curved 
surfaces. 


Measurement requires an independence of quantity from position, which can 
occur in more than one way. The hypothesis which first presents itself, and which 
I shall develop here, is just this, that the length of lines is independent of their 
configuration，so that every line can be measured by every other. If position¬ 
fixing is reduced to numerical determinations, so that the position of a point in the 
given n fold extended manifold is expressed by n varying quantities xi, 吨，抑， 
and so forth up to then specifying a line amounts to giving the quantities a? 
as functions of one variable. The problem then is, to set up a mathematical 
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expression for the length of a line, for which purpose the quantities x must be 
thought of as expressible in units. I will treat this problem only under certain 
restrictions, and I first limit myself to lines in which the ratios of the quantities dx 
— the increments in the quantities x —— vary continuously; one can then regard 
the lines as broken up into elements within which the ratios of the quantities dx 
maybe considered to be constant, and the problem then reduces to setting up a 
general expression for the line element ds at every point, an expression which 
will involve the quantities x and the quantities dx. I assume, secondly, that the 
length of the line element remains unchanged, up to first order, when all the 
points of this line element suffer the same infinitesimal displacement, whereby 
I simply mean that if all the quantities dx increase in the same ratio, the line 
element changes by the same ratio. Under these assumptions, the line element 
can be an arbitrary homogeneous function of the first degree in the quantities 
dx which remains the same when all the quantities dx change sign, and in which 
the arbitrary constants are functions of the quantities x. To find the simplest 
cases, I first seek an expression for the (n — l)fold extended manifolds which 
are everywhere equidistant from the origin of the line element, i.e., I seek a 
continuous function of position which distinguishes them from one another. This 
must either decrease or increase in all directions from the origin; I will assume 
that it increases in all directions and therefore has a minimum at the origin. Then 
if its first and second differential quotients are finite, the first order differential 
must vanish and the second order differential cannot be negative; I assume that 
it is always positive. This differential expression of the second order remains 
constant if ds remains constant and increases quadratically when the quantities 
dx, and thus also ds 7 all increase in the same ratio; it is therefore = constant, ds 2 
and consequently ds = the square root of an everywhere positive homogeneous 
function of the second degree in the quantities dx y in which the coefficients are 
continuous functions of the quantities x. In Space, if one expresses the location 
of a point by rectilinear coordinates, then ds = ^U(dx) 2 ; Space is therefore 
included in this simplest case. The next simplest case would perhaps include 
the manifolds in which the line element can be expressed as the fourth root 
of a differential expression of the fourth degree. Investigation of this more 
general class would actually require no essentially different principles, but it 
would be rather time consuming and throw proportionally little new light on the 
study of Space, especially since the results cannot be expressed geometrically; 
I consequently restrict myself to those manifolds where the line element can be 
expressed by the square root of a differential expression of the second degree. 
One can transform such an expression into another similar one by substituting 
for the n independent variables, functions of n new independent variables. 
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However, one cannot transform any expression into any other in this way; 
for the expression contains - coefficients which are arbitrary functions of 
the independent variables; by the introduction of new variables one can satisfy 
only n conditions, and can therefore make only n of the coefficients equal to 
given quantities. There remain others, already completely determined by 

the nature of the manifold to be represented, and consequently functions 

of position are required to determine its metric relations. Manifolds, like the 
Plane and Space, in which the line element can be brought into the form V Ux 2 
thus constitute only a special case of the manifolds to be investigated here; 
they clearly deserve a special name, and consequently, these manifolds, in 
which the square of the lines element can be expressed as the sum of the 
squares of complete differentials, I propose to call flat. In order to survey 
the essential differences of the manifolds representable in the assumed form, 
it is necessary to eliminate the features depending on the mode of presentation, 
which is accomplished by choosing the variable quantities according to a definite 
principle. 


2 . 

For this purpose, one constructs the system of shortest lines emanating from 
a given point; the position of an arbitrary point can then be determined by the 
initial direction of the shortest line in which it lies, and its distance, in this line, 
from the initial point. It can therefore be expressed by the ratios of the quantities 
dx°, i.e., the quantities dx at the origin of this shortest line, and by the length s 
of this line. In place of the dx° one now introduces linear expressions da formed 
from them in such a way that the initial value of the square of the line element will 
be equal to the sum of the squares of these expressions, so that the independent 
variables are: the quantity s and the ratio of the quantities da. Finally, in place 
of the da choose quantities X 2 , ..., x v proportional to them, but such that 
the sum of their squares equals s 2 . If one introduces these quantities, then 
for infinitely small values of x the square of the line element = Edx 2 , but the 
next order term in its expansion equals a homogeneous expression of the second 
degree in the quantities dx^ — X 2 dx !)，(xi dx^ — X 3 dxi), ..., and is 
consequently an infinitely small quantity of the fourth order, so that one obtains 
a finite quantity if one divides it by the square of the infinitely small triangle 
at whose vertices the variables have the values ( 0 , 0 ， 0 ， •••) ， (xi, 巧，上 ’ 3 …)， 

This quantity remains the same as long as the quantities x and 
dx are contained in the same binary linear forms, or as long as the two shortest 
lines from the initial point to .r and from the initial point to dx remain in the 
same surface element, and therefore depends only on the position and direction 
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of that element. It obviously = zero if the manifold in question is flat ， i.e.，if the 
square of the line element is reducible to Edx 2 , and can therefore be regarded 
as the measure of deviation from flatness in this surface direction at this point. 
When multiplied by — | it becomes equal to the quantity which Privy Councilor 

Gauss has called the curvature of a surface. Previously, n y -^ functions of 
position were found necessary in order to determine the metric relations of 
an 凡 fold extended manifold representable in the assumed form; hence if the 
curvature is given in surface directions at every point, then the metric 
relations of the manifold may be determined, provided only that no identical 
relations can be found between these values, and indeed in general this does not 
occur. The metric relations of these manifolds, in which the line element can be 
represented as the square root of a differential expression of the second degree, 
can thus be expressed in a way completely independent of the choice of the 
varying quantities. A similar path to the same goal could also be taken in those 
manifolds in which the line element is expressed in a less simple way, e.g., by the 
fourth root of a differential expression of the fourth degree. The line element in 
this more general case would not be reducible to the square root of a quadratic 
sum of differential expressions, and therefore in the expression for the square of 
the line element the deviation from flatness would be an infinitely small quantity 
of the second dimension, whereas for the other manifolds it was an infinitely 
small quantity of the fourth dimension. This peculiarity of the latter manifolds 
therefore might well be called plainness in the smallest parts. For present 
purposes, however, the most important peculiarity of these manifolds, on whose 
account alone they have been examined here, is this, that the metric relations 
of the doubly extended ones can be represented geometrically by surfaces and 
those of the multiply extended ones can be reduced to those of the surfaces 
contained within them, which still requires a brief discussion. 

3. 

In the conception of surfaces, the inner metric relations, which involve only 
the lengths of paths within them, are always bound up with the way the surfaces 
are situated with respect to points outside them. We may, however, abstract 
from external relations by considering deformations which leave the lengths of 
lines within the surfaces unaltered, i.e.，by considering arbitrary bendings — 
without stretching* — of such surfaces, and by regarding all surfaces obtained 
from one another in this way as equivalent. Thus, for example, arbitrary 
cylindrical or conical surfaces count as equivalent to a plane, since they can 
be formed from a plane by mere bending, under which the inner metric relations 
remain the same; and all theorems about the plane — hence all of planimetry 
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— retain their validity. On the other hand, they count as essentially different 
from the sphere, which cannot be transformed into the plane without stretching. 
According to the previous investigations, the inner metric relations at every 
point of a doubly extended quantity, if its line element can be expressed as the 
square root of a differential expression of the second degree, which is the case 
with surfaces, is characterized by the curvature. For surfaces, this quantity 
can be given a visual interpretation as the product of the two curvatures of 
the surface at this point, or by the fact that its product with an infinitely 
small triangle formed from shortest lines is, in proportion to the radius, half 
the excess of the sum of its angles over two right angles. The first definition 
would presuppose the theorem that the product of the two radii of curvatures 
is unaltered by mere bendings of a surface, the second, that at each point the 
excess over two right angles of the sum of the angles of any infinitely small 
triangle is proportional to its area. To give a tangible meaning to the curvature 
of an nfold extended manifold at a given point, and in a given surface direction 
through it, we first mention that a shortest line emanating from a point is 
completely determined if its initial direction is given. Consequently we obtain 
a certain surface if we prolong all the initial directions from the given point 
which lie in the given surface element, into shortest lines; and this surface has 
a definite curvature at the given point, which is equal to the curvature of the 
?ifold extended manifold at the given point, in the given surface direction. 

4. 

Before applying these results to Space, it is still necessary to make some 
general considerations about flat manifolds, i.e., about manifolds in which the 
square of the line element can be represented as the sum of squares of complete 
differentials. 

In a flat n fold extended manifold the curvature in every direction, at every 
point, is zero; but according to the preceding investigation, in order to determine 
the metric relations it suffices to know that at each point the curvature is zero 
in independent surface-directions. The manifolds whose curvature is 

everywhere = 0 can be considered as a special case of those manifolds whose 
curvature is everywhere constant. The common character of those manifolds 
whose curvature is constant maybe expressed as follows: figures can be moved 
in them without stretching. For obviously figures could not be freely shifted 
and rotated in them if the curvature were not the same in all directions, at all 
points. On the other hand, the metric properties of the manifold are completely 
determined by the curvature; they are therefore exactly the same in all the 
directions around any one point as in the directions around any other, and thus 
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the same constructions can be effected starting from either; consequently, in the 
manifolds with constant curvature figures maybe given any arbitrary position. 
The metric relations of these manifolds depend only on the value of the curvature, 
and it maybe mentioned, as regards the analytic presentation, that if one denotes 
this value by a, then the expression for the line element can be put in the form 



5. 

The consideration of surfaces with constant curvature may serve for a 
geometric illustration. It is easy to see that the surfaces whose curvature is 
positive can always be rolled onto a sphere whose radius is the reciprocal of the 
curvature; but in order to survey the multiplicity of these surfaces, let one of 
them be given the shape of a sphere, and the others the shape of surfaces of 
rotation which touch it along the equator. The surfaces with greater curvature 
than the sphere will then touch the sphere from inside and take a form like 
the portion of the surface of a ring, which is situated away from the axis; they 
could be rolled upon zones of spheres with smaller radii, but would go round 
more than once. Surfaces with smaller positive curvature are obtained from 
spheres of larger radii by cutting out a portion bounded by two great semi¬ 
circles, and bringing together the cut-lines. The surface of curvature zero will 
be a cylinder standing on the equator; the surfaces with negative curvature will 
touch this cylinder from outside and be formed like the part of the surface of a 
ring which is situated near the axis. If one regards these surfaces as possible 
positions for pieces of surface moving in them, as Space is for bodies, then pieces 
of surface can be moved in all these surfaces without stretching. The surfaces 
with positive curvature can always be so formed that pieces of surface can even 
be moved arbitrarily without bending, namely as spherical surfaces, but those 
with negative curvature cannot. Aside from this independence of position for 
surface pieces, in surfaces with zero curvature there is also an independence of 
position for directions, which does not hold in the other surfaces. 

III. Applications to Space. 


Following these investigations into the determination of the metric relations 
of an 於 fold extended quantity, the conditions may be given which are sufficient 
and necessary for determining the metric relations of Space, if we assume 
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beforehand the independence of lines from configuration and the possibility of 
expressing the line element as the square root of a second order differential 
expression, and thus flatness in the smallest parts. 

First, these conditions may be expressed by saying that the curvature at 
every point equals zero in three surface directions, and thus the metric relations 
of Space are implied if the sum of the angles of a triangle always equals two right 
angles. 

But secondly, if one assumes with Euclid not only the existence of lines 
independently of configuration, but also of bodies, then it follows that the cur¬ 
vature is everywhere constant, and the angle sum in all triangles is determined 
if it is known in one. 

In the third place, finally, instead of assuming the length of lines to be 
independent of place and direction, one might assume that their length and 
direction is independent of place. According to this conception, changes or 
differences in position are complex quantities expressible in three independent 
units. 


2 . 

In the course of the previous considerations, the relations of extension or 
regionality were first distinguished from the metric relations, and it was found 
that different metric relations were conceivable along with the same relations of 
extension; then systems of simple metric specifications were sought by means 
of which the metric relations of Space are completely determined, and from 
which all theorems about it are a necessary consequence. It remains now to 
discuss the question how, to what degree, and to what extent these assumptions 
are borne out by experience. In this connection there is an essential difference 
between mere relations of extension and metric relations, in that among the 
first, where the possible cases form a discrete manifold, the declarations of 
experience are to be sure never completely certain, but they are not inexact, 
while for the second, where the possible cases forma continuous manifold, every 
determination from experience always remains inexact — be the probability 
ever so great that it is nearly exact. This circumstance becomes important when 
these empirical determinations are extended beyond the limits of observation 
into the immeasurably large and the immeasurably small; for the latter may 
obviously become ever more inexact beyond the boundary of observation, but 
not so the former. 

When constructions in Space are extended into the immeasurably large, 
unboundedness is to be distinguished from infinitude; one belongs to relations 
of extension, the other to metric relations. That Space is an unbounded triply 
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extended manifold is an assumption which is employed for every apprehension 
of the external world, by which at every moment the domain of actual perception 
is supplemented, and by which the possible locations of a sought for object 
are constructed; and in these applications it is continually confirmed. The 
unboundedness of space consequently has a greater empirical certainty than 
any experience of the external world. But its infinitude does not in any way 
follow from this; quite to the contrary, Space would necessarily be finite if one 
assumed independence of bodies from position, and thus ascribed to it a constant 
curvature, as long as this curvature had ever so small a positive value. If one 
prolonged the initial directions lying in a surface direction into shortest lines, 
one would obtain an unbounded surface with constant positive curvature, and 
thus a surface which in a flat triply extended manifold would take the form of a 
sphere, and consequently be finite. 


3. 

Questions about the immeasurably large are idle questions for the expla¬ 
nation of Nature. But the situation is quite different with questions about the 
immeasurably small. Upon the exactness with which we pursue phenomena into 
the infinitely small, does our knowledge of their causal connections essentially 
depend. The progress of recent centuries in understanding the mechanisms 
of Nature depends almost entirely on the exactness of construction which has 
become possible through the invention of the analysis of the infinite and through 
the simple principles discovered by Archimedes, Galileo, and Newton, which 
modern physics makes use of. By contrast, in the natural sciences where the 
simple principles for such constructions are still lacking, to discover causal 
connections one pursues phenomenon into the spatially small, just so far as 
the microscope permits. Questions about the metric relations of Space in the 
immeasurably small are thus not idle ones. 

If one assumes that bodies exist independently of position, then the curva¬ 
ture is everywhere constant, and it then follows from astronomical measure¬ 
ments that it cannot be different from zero; or at any rate its reciprocal must be 
an area in comparison with which the range of our telescopes can be neglected. 
But if such an independence of bodies from position does not exist, then one 
cannot draw conclusions about metric relations in the infinitely small from those 
in the large; at every point the curvature can have arbitrary values in three 
directions, provided only that the total curvature of every measurable portion 
of Space is not perceptibly different from zero. Still more complicated relations 
can occur if the line element cannot be represented, as was presupposed, by the 
square root of a differential expression of the second degree. Nowit seems that 
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the empirical notions on which the metric determinations of Space are based, the 
concept of a solid body and that of a light ray，lose their validity in the infinitely 
small; it is therefore quite definitely conceivable that the metric relations of 
Space in the infinitely small do not conform to the hypotheses of geometry; and 
in fact one ought to assume this as soon as it permits a simpler way of explaining 
phenomena. 

The question of the validity of the hypotheses of geometry in the infinitely 
small is connected with the question of the basis for the metric relations of Space. 
In connection with this question, which may indeed still be ranked as part of the 
study of Space，the above remark is applicable，that in a discrete manifold the 
principle of metric relations is already contained in the concept of the manifold， 
but in a continuous one it must come from something else. Therefore, either 
the reality underlying Space must form a discrete manifold，or the basis for the 
metric relations must be sought outside it, in binding forces acting upon it. 

An answer to these questions can be found only by starting from that concep¬ 
tion of phenomena which has hitherto been approved by experience，for which 
Newton laid the foundation, and gradually modifying it under the compulsion 
of facts which cannot be explained by it. Investigations like the one just made， 
which begin from general concepts，can serve only to insure that this work is not 
hindered by unduly restricted concepts, and that progress in comprehending 
the connection of things is not obstructed by traditional prejudices. 

This leads us away into the domain of another science, the realm of physics， 
into which the nature of the present occasion does not allow us to enter. 
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B. WHAT DID RIEMANN SAY? 

Upon a first reading, Riemann’s lecture may appear to have almost no math¬ 
ematical content. But this is only because the analytic investigations, which 
occur in Part II, have been drastically condensed, while Part I explains, in gen¬ 
eral philosophical terms, important mathematical concepts which succeeding 
generations of investigators were eventually able to express with mathematical 
precision; finally, Part III of the lecture deals with applications of the mathe¬ 
matical discoveries to questions in physics, a process which is perhaps not yet 
complete. 

In this commentary on Riemann’s lecture, we will follow closely the order of 
Riemann’s exposition, referring often to the various sections (1， 2， etc.) within 
each part (I, II, III). It should not be expected that all details will be cleared up, 
even in the remaining portions of this chapter, for the complete consideration of 
Riemann’s ideas will occupy several of the succeeding chapters. Consequently， 
the remaining parts of Chapter 4 may be the hardest reading encountered in 
either of the two volumes of these notes. Nevertheless, we hope that in the end 
a dear view of all these ideas will be obtained. 

In the “Plan of the Investigation”，Riemann begins by accounting for the 
confusion over the status of non-Euclidean geometry, which at this time was still 
not completely accepted. In 1829 , Lobachevsky and Bolyai had independently 
constructed a system of geometry which began by assuming that through a 
point not on a line there was more than one line parallel to it (as opposed to the 
assumption that there is only one parallel line, which is equivalent to Euclid’s 
Fifth Postulate); but it was still supposed by some that contradictions in this 
system would eventually be found. 

Riemann attributes the difficulties encountered in the study of non-Euclidean 
geometry to the fact that geometers had never separated what we would call 
the topological properties of space from its metric properties; in the axiomatic 
development of geometry, even the notion of space itself is undefined, and its 
properties are developed through the axioms. 

Riemann proposes to distinguish the metric properties from the topological 
properties, and promises that we will discover how different metric structures 
can be put on the triply extended quantity which constitutes Space, so that one 
cannot possibly expect to deduce the parallel postulate of Euclid from topolog¬ 
ical considerations alone. This implies that experimental data must be used to 
determine what metric properties Space actually has, and raises the question 
which data we should seek, and what we can expect to say about the regions of 
Space too distant, or too small, to be investigated experimentally. 
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In Part I， “Concept of an n fold extended quantity’’，Riemann is clearly trying 
to define a manifold. 

It is impossible to tell from this lecture, intended for non-mathematicians, 
how far Riemann had advanced toward the precise solution of this problem, 
and whether he had any way of expressing concretely the notion of a metric or 
topological space, which is essentially prerequisite to the definition of a manifold. 
However, it is quite obvious that the notion was thoroughly clear in his own 
mind and that he recognized that manifolds were characterized by the fact that 
they are locally like ^-dimensional Euclidean space. It is also clear that he 
understood the importance of infinite dimensional spaces，such as the set of 
all real-valued functions on a space (it is interesting that quite recently some 
of these infinite dimensional spaces have been given the structure of “infinite 
dimensional manifolds”，and differential geometric methods have been applied 
to them with great success). 

Part II contains nearly all the mathematical results，and the discussion of this 
Part will take up most of the present chapter 

The difficulties in Part II begin right with the title, “Metric relations of which 
a manifold of « dimensions is susceptible, on the assumption that lines have a 
length independent of their configuration，so that every line can be measured by 
every other”. To understand what Riemann means, it is necessary to recall the 
process by which lengths are assigned to curves in the plane or 3-dimensional 
space of analytic geometry. In this case，we begin with the notion of distance 
between pairs of points, which amounts to saying that we first assign a length to 
straight lines; the length of other lines is then defined as the least upper bound 
of inscribed curves made up of straight lines, a process which can be reduced 
to integration. In this method of assigning lengths to curves, it may be said that 
all curves are measured by means of straight lines. 

By contrast, Riemann proposes to consider a uniform method of assigning 
lengths to all curves in a manifold, a method which does not depend on first 
distinguishing a particular class of curves. This is to be done by measuring the 
lengths of tangent vectors, so that the lengths of curves can be defined by an 
integral (a restriction to C 1 curves is first indicated). Riemann assumes that 
this “length” function / is continuous on each tangent space and also positive 
homogeneous — the length f(kv) of kv is \k\ times the length f(v) of u. 

Now there are many kinds of positive homogeneous functions on a finite 
dimensional vector space; any subset of the vector space which is symmetric 
with respect to the origin, and intersects each ray through 0 just once, can be 
used as the set of vectors of length 1. Riemann notes that the partial derivative 
of f 2 (with respect to some basis of the tangent space M p ) must vanish at 
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0 € M p , and that the matrix of second order partial derivatives is positive semi¬ 
vectors of length 1 


definite. He then assumes, as the simplest possibility, that it is actually positive 

definite. This means that / can be expressed as y Sij (p) - dx^ for certain 

numbers gij{p). An assignment to each tangent space M p of such a norm, or 
more precisely the inner product from which it comes, is, of course，what we 
now call a Riemannian metric on the manifold M, 

Riemann points out that it is merely to save time, and to allow geometric 
descriptions of the results, that he restricts his attention to the special case. 
Certain more general cases，though not the most general of all, were investigated 
by Finsler in his thesis (1918), and are now known as Finsler metrics; it seems 
clear, however, that Riemann must have already known the basic facts about 
these more general metrics (some information on Finsler metrics is given in the 
Addendum). 

Having restricted his attention to “Riemannian manifolds”，Riemann now 
asks the crucial question: when does the introduction of a new coordinate sys¬ 
tem change the metric ^ Sij dy ( ^)dy J into some given metric ^ a(jdx i ^>dx J ; 
in other words，when are two Riemannian manifolds locally isometric? Rie¬ 
mann here presents one of his famous “counting arguments，，，which enabled 
him to guess results that in some cases were not rigorously proved until a hun¬ 
dred years later. Riemann argues that the expression ^ gijdx 1 (g) dx 】 contains 
functions (not n 2 , for g(j — gji) while a new coordinate system involves 
only n functions, so that we can change only n of the g ”，leaving other 
functions which depend on the metric; consequently, Riemann argues, there 
should be some set of functions which will determine the metric com¬ 

pletely. 

In section 2 of Part II, Riemann indicates how such functions are to be 
found. We are going to apply a standard technique for the study of differentiable 
functions — we examine the Taylor polynomials approximating the functions gij . 
If x is a coordinate system on M, with x(p) = 0, and the Riemannian met¬ 
ric is given by ( ， 〉 = J 2 Sijdx i 0 dx J \ then for the Taylor expansion of the 
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function gij o x -1 : ^ M we have 


n 




gij ox~ l (t) = (gij + 

k = \ 

1 w 

+ ~ Dk ， i(gij 。 x 卜⑼ A’+W| 2 ). 


k、l = \ 


Hence on M we have 




k- 


2 k % dxkdxl 


where o(|x| 2 ) denotes a function f on M such that 


q—p \x{q)\ l 


0 . 


However, and this is the important device Riemann introduces, we will select 
a very special coordinate system around each point p e M. We choose an 
orthonormal basis X\,... ,X n e M p , and define a coordinate system x : M p —> 
M w on M P by x(E^^)= (a 1 ，..., a n ). Then we let x be the coordinate 
system x o exp -1 . (This coordinate system is introduced at the very beginning 



of section 2 , but it takes a little work to decipher Riemann’s description of it.) 

The coordinate system x is not uniquely determined, for it depends on the 
choice of the orthonormal basis X\,... ,X n e M p \ but any two differ by an 
element of ()(«)，so it will not be hard to take into account the way any of our 
results depend on this choice. These coordinate systems are called Riemannian 
normal coordinates at p. Notice that since exp*: (M p )q M p is the identity 
(upon identifying (M p )q with M p ), we have 

— Xf — G p . 
p 

We can quickly give some information about the first two terms in the expan¬ 
sion (*) of gij : 
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1. PROPOSITION. In a Riemannian normal coordinate system x at ^ we 
have 


gij(p) = 
^gij 


dx k 


(p) = o. 


PROOF. The first set of equations is clear, for 


gij(p) 


dx l 


p 


’ dxj 


{Xi, Xj) = 8 ij. 


p, 


To prove the second set of equations, we recall the equations for a geodesic y \ 


dly \tT^)) dyidy] 


dt 2 




dt dt 


0. 


In Riemannian normal coordinates the geodesics through p are just exp o c, 
where c is a straight line in M p . This means that for all w-tuples ( 专 、 ... 
the geodesics through p are the curves y with y k (t) = Hence 


n 


for the geodesic y k (t) = ? 


k 


In particular, since p = y(Qi) is on all these geodesics, we have 


T^j(p)^ l ^ J = 0 for all n-tuples (^ 1 , ... ，专 ”). 

Oi 


This shows that all rf^(p) are 0: choosing all = 0 except = 1 gives = 0; 
then choosing all f a = 0 except = 1 gives 

o = rf ) ⑻ + r>) + ii( P ) + rfj( P ) = 2r^(p). 

It follows that 

n 

[" ，々 ] = ^Sal^ij =0 at p. 
a=l 

Making use of equation (*) on pg. 1 . 331 ， we have finally, 

9 j?" 

= [^J] + [jkj] =0 at p. ♦ 
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In view of Proposition 1 ， we can now use (*) to expand the squared norm 
1 2 as 


gijdx 1 dx } 


ij = \ 


a 2 g" 


— dx dx + - ——j—^-f(p)x k x dx dx J +o(|x| 2 ). 

2 , ox k ox l 

f —1 ij\kj 

[This is an equation for tangent vectors near p, and o(|x| 2 ) now denotes a 
function f on tangent vectors; in order to have 

Hm ^=0, 
q—p \x \ 2 

we must restrict v q to be of some bounded length.] Riemann’s main assertion 
involves the term 

- 3 发 " {p)x k x l dx 1 dx^ — Ci ； j c ix k x l dx 1 say. 

2 . dx k dx l ， 

Riemann asserts that there are numbers such that we can write 

Cij^ix k x l dx 1 dx^ = ^ Cij t j c i(x k dx l — x l dx k ) - (x l dx^ — x)dx 1 ). 
ij\kj ij\kj 


This assertion immediately suggests three questions — Why did Riemann suspect 
this was true? How did he prove it? What is its significance? 

We will begin by giving a partial answer to the third of these questions. No¬ 
tice that the equation in question is supposed to hold for all tangent vectors v 

• 

at all points ^ in a neighborhood of p. Consequently, the numbers dx 1 (u) 
[and x l ((7)] can take on all [sufficiently small] values. The coordinate sys¬ 
tem x and the Riemannian metric { , ) are used to obtain the n 4 numbers 
Cij’ki = \ ^ 2 gij / 3 a^ dx 1 (p); but beyond this, the above equation has nothing 
to do with the manifold at all. If we define a quadratic polynomial Q in In 
variables bv 

r 

2(U) = 2( 不， … ， U，. . .，>；)= [ c ij ， klXiXjY k Y h 

ij：kj 


then Riemann is asserting that this quadratic polynomial can be written as 

Q(X,Y)= C ijM {XiY k - X k Y t ) • {XjYi - XiYj). 

i'j.'k'l 
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To obtain the geometric consequences of this fact, we observe what it says when 

• • 

we select two vectors v p , w p e M p and let X\ = dx l (v p ) and Y\ = dx l {w p )\ 
denoting Q(X, Y) by Q(v p , w p ), we have 

Q{v p ^p) = c iJik i dx i {v p )dx j {v p ) - dx k {w p )dx l {w p ) 

= c ij,kl[(dx l A dx k )(v p , Wp)] - [{dx ] A dx l ){v p , Wp)]. 

[We can also write 

Q = ^ c ij,kl dx l dx } dx k dx l = ^ Qj fd(dx l A dx k ) • (dx^ A dx l ), 
ij；kj 

a little more simply.] Now suppose v f p ,w f p e M p span the same subspace as 
v p , Wp, so that we can write 

v f p = a\\V p -\-a2\Wn 
, det(a")#0. 

切 p = ^I2^p "I - ^22^p 

The right side of the above equation for Q(v p , w p ) shows that 

QW p ,w f p ) = [det(a")] 2 . Q{v p ,w p ), 

since each dx a A dx^ is multiplied by the factor det(«/ 7 *). If we use \\v p , w p \\ to 
denote the area of the parallelogram spanned by v p and w p , then we also have 

lli/pu/p" 2 = [det(a")] 2 . \\v p ,Wp\\ 2 . 

Consequently, 

Q(v f p,w f p ) _ QjVp^Wp) 

\Wp^ f p \\ 2 ~ \\v p ,W p \\ 2 ' 

We therefore have a way of assigning a number to every 2-dimensional subspace 
of the tangent space at p. (Riemann sticks to the original quadratic function 
of the x l and dx\ which puts him in the position of having to divide by the 
squared area of a very strange triangle, with one vertex at x\ and one at dx l .) 

It is easy to see that if \ve pick a different Riemannian normal coordinate sys¬ 
tem at p, then the resulting function on the 2-dimensional subspaces of M p will 
be the same, for Q(v p , w p ) = Q{dx l {v p ),... ,dx n (v p ),dx i (w p ),... ,dx n {w p )) 
will change by (det B) 2 , where B e 0 ("), so that det B = dzl. We will ex¬ 
amine later the significance of this new function on 2-dimensional subspaces of 
the tangent space. For the present we take up the other questions — Why did 
Riemann think it was true, and how did he prove it? 
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Of course，an answer to the first question is not only doomed to be mere 
conjecture, but is always foolhardy to put forth, for there is no accounting for 
genius. The best suggestion I can offer is that the dependence of Q(v P7 w p ) on 
the span of v p and w p alone is certainly an attractive one，and as we shall see 
later, in one special case which Riemann may have investigated first, the result 
appears in a rather natural way. It is also impossible to say for sure how Rie¬ 
mann proved the result, for his own investigations were never published. I have 
used the remarks by H. Weber in Riemann’s collected works (pp. 405 — 409 ), 
as well as the commentary given by Herman Weyl in a special edition of Rie- 
mann’s lecture. There are two parts to the proof, a purely algebraic one about 
quadratic functions, which determines what relations the numbers ought 
to satisfy, and an analytic one which establishes these relations. 

For the algebraic part, we will be considering a quadratic function Q of 2 n 
variables 

Q(X 7 Y)^Q(X l 7 ,,, 7 X n ,Y u ... 7 Y n )= J 2 Cij^XiXjYkYi. 

Note that for our Q we have 

c ij\kl = c ji,kl — c ij ， lk ， 

(using gij = gji, and d 2 /dx k dx l = d 2 /dx l dx k ). A = (a") is a 2 x 2 matrix, 

we will use A{X^ Y) to denote the -tuple 

A(X 7 Y) = (a\\X + G 2 \ Y, a\ 2 ^ + cinY) 

= {a\\X\ + U 2 \ Y \, ..., a\\X n - ai\ Y n , anX\ + cinYi ，…，+ ^22^)- 

2 . PROPOSITION. Let 2 be a quadratic function of In variables ， 

Q{X 7 Y)= Cij^XiXjYkYi, 

ij\kj 

where 

(1) c ij,kl = c ji,kl — c ij ， lk. 

Then 

Q(A(X,Y)) ^(det A ) 2 Q{X,Y) 
for all 2 x 2 matrices A if and only if: 

( 2 ) 

⑶ 


c ij,kl — c kl,ij 
Clijk + Hj,ki + ^lk,ij = 0. 



The Curvature of Higher Dimensional Manifolds 


171 


PROOF. First of all, the equation Q(A(X, Y)) = (det A) 2 Q(X, Y) clearly holds 
for all 2 x 2 matrices A if and only if it holds for the non-singular ones，since 
both sides of the equation are continuous functions of and the non-singular 
matrices are dense. 

Now it is well known that all non-singular 2x2 matrices can be written as a 
product of the matrices 



[This comes from the fact that any non-singular matrix can be obtained from 
the identity matrix by a sequence of elementary row operations, and every row 
operation may be accomplished by multiplying by one of the above matrices.] 
So our condition holds for all A if and only if it holds for the above matrices. 
We can disregard the last matrix, since 



For the matrix ^4 = ^ ?) ， the condition Q(A(X, K)) = (det A) 2 Q(X, Y) 

becomes simply 

Q(aX,Y)=a 2 Q(X,Yl 


which is automatically true. The same result holds for the second matrix on 
our list, so all the conditions finally come down to 


( a ) 


(b) 


Q(Y,X) = Q(X,Y) 
Q{X + Y,Y)=Q(X,Y) 




Now equation (a) becomes 


^ Cij.kiXiXjYkYi = ^ Cjj^iYiYj X k Xi. 


Since this must be a polynomial identity，we obtain ( 2 ) immediately, by looking 
at the coefficient of XfXjY^Yi on both sides. 

Equation (b) becomes 

[ + ^f)(^7 + = [ Cij^iXfXjYicYi, 
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or 

c U,kl [^/ Yj Y k Yi + ^ Yt Y k Yi + Yt Yj ^K/]-0. 

Letting A" = 0, we obtain 

(bl) ^2 c U^l Y i Y J Y ^ Y l ~ 0 

and then, in consequence ， 

(b2) J2 CijM^yjYkYi + XjY^Y t ] = 0. 

ij\kj 

On the other hand ， (b2) implies (bl)，so (b) is equivalent to (b2) alone. Finally ， 
since c/y^/ = Cji,kh equation (b2) is equivalent to 

(b3) [ dj^XiYjYkY^O. 

Looking at the coefficient of a particular XiYjY^Yi we obtain 

Cij，kl + ^ijjk ~l~ Cik，jl ^ikjj ~l~ Q/,yA: ~l~ ^il,kj — 

Using the symmetry with respect to the last two indices, this is equivalent to 
equation (3) ‘ ♦ 

3. PROPOSITION. A quadratic function 

Q(X,Y)= J2 c ijM XiXjY k Yi 

with 

(1) = Cji.kl = CijMl 

satisfies the two equivalent conditions of Proposition 2 if and only if it can be 
written as 

Q(X^Y)= J2 c uM^Y k - X k Yt) - (XjYi - XtYj)^ 
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PROOF. If Q can be written this way, then we will clearly have Q(A(X, Y))= 
(det A) 2 Q{X^ Y) for all A. Conversely，suppose this holds for all A, so that we 
also have 

(2) ~ c kl，ij 

(3) Ciij k + Cij^i + Clkjj = 0 . 

We begin by writing four equivalent expressions for Q: 

Q(X y Y) = ^2c iJikl X i X j Y k Yi 

= [ CjkjiXjX k YiYi 

= ^2c ilJk X i X l Y J Y k 
= ^2c fcUj X k X l Y i Yj. 

Now, by (3) we have 

c jkjl = —Cjijk — c jl ， ki ， 

so 

Cjk ， ilXjX k YiYi = — ^ cy^ikXjX k YiYi - ^ Cji^iXjX k YiYi 

cjutcXjX.YiY!- ^ cujiXjX.YtYi 

(interchanging j and k in the second sum) 
=~ 2 Y1 CijjciXjXfcYiY !， using ( 1 ) and ( 2 ). 

If we apply a similar process to the third expression for Q, use (2) on the fourth, 
and leave the first unaltered, we obtain 

Q{X,Y)^Y, c UM^i^jYkYi 

^Q(X,Y) = -^2c ij ^ kl XjX k Y i Yi 

\Q{X,Y) = -Y, Cij^iXiXfYjY,, 

Q(X,Y) = ^2c Uikl X k X l Y i Y j . 

Adding，we obtain the desired result, 

3Q(X,Y)= ^2 c ih ki(XiY k - Xf.Yt) - {XjYt - XiYj). ❖ 

iJ'MJ 
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We now proceed with the hardest part of the investigation, a hairy calculation 
indeed. 


4. PROPOSITION. In a Riemannian normal coordinate system x at the 
numbers 

_ 1 沪卽 r 、 

CiUkl = 2 dM (p) 

satisfy 


Cli，jk + Clj、ki 七 Clkjj — 0 . 

PROOF. We begin with an equation derived in the proof of Proposition 1. For 
the geodesic y k {t) — ^ k t we have 

n 

J2 1 ⑽ ))rv’ = o; 

ij = \ 


multiplying by t\ we have 

n 

J 2 lt_( K (0)?’(K ⑴ ⑴ ） =0. 

ij=\ 

Since these geodesics go through all points in a neighborhood of we have 
the following relation between the functions and x l : 

n 

⑴ J2 二 o. 

/j=i 


Since the tangent vector to the geodesic y k (t) = ^ k t has constant length, we 
also obtain 

dy dy 


dt ’ dt 


W) 2 , 


which leads, in the same way, to the equation 


( 2 ) 


如 w = ZX'’) 2 _ 
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Now equation (1) leads to 

n 

X ] ["，众]?汐 = 0， 

i ， j=i 

i.e.，to 

^ 2 V dx } dx l dx k ) 

= i 

Interchanging the indices i and j in the second term, we can write 

( 3 ) t 


Ogik 1 Ogi 


J 


dx^ 2 dx k 


xx 


j 


0 . 


Our penultimate goal is to break this equation up into two sums，each of which 
is individually 0; the conditions on the c’s，which are our ultimate goal，will then 
follow fairly easily. To achieve this, our antepenultimate goal is to prove that 
S^a x(X y these equations are at least reasonable，for they imply (2). To 
prove these relations, we first introduce the functions defined by 


Note that 


n 

^ 私 ？. 

oc=l 


dx 替 


dxy 




dxy 


x<X + 


Substituting in (3)，we obtain 
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Now by ( 2 ) and the definition of x\ we have 



so we obtain 



— X 


k 



This equation shows that along any geodesic y{t) = ^ l t we have 

d[x k - x k ]{y{t)) n 

- It - =0 ， 

so that x k — x k is constant along the geodesic. Since gij(p) = & 7 , we clearly 
have x k {p) — x k {p). Moreover，these geodesics pass through all points in a 
neighborhood of p. Thus x k = in a neighborhood of p，so that we finally 
obtain the desired equations 


n 


( 4 ) 


gk(xX<x 


X 


k 


a 


Now we differentiate (4) to obtain 


n 


E 


+ gki = hi 


a: 


multiplying by x l and summing, we obtain 


[ ~^T x<Xxi = + 


aA=\ 


which, together with (4) gives 




aJ=l 


dx l 


—x k + x k — 0 . 
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and we have thus obtained the first part of our penultimate goal ， 


E 


^gik 


XX 


Together with (3)，it implies the other part, 


^ 3^7 


E 


dx k 


1 = 0 . 


We now obtain the desired equations as follows. Along the geodesic y k {t) 
we have，by ⑹， 


史磬 (，_ 奶 2=0 . 


This implies that 


E 磬⑽ )) ⑼ = 0 


for t ^ 0, and hence even for ^ = 0, by continuity. Differentiating (7) with 
respect to t gives 

°=t 昝 _ 砂 2 t+ ± W， 2 

/ =1 / i 1 — \ 


i 為 (離 


i ， jj' 


by (8); 


consequently, 




E 


i ， j ， l •- 


dx k dx l 


(ym l •押 


for all t and hence also for ^ = 0. Setting f = 0, we obtain 

名為 7( 虐 =。. 
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This equation holds for all ^-tuples f 1 ，...From this we easily derive 

(A) Cij\u + cujk + cjijk = 0. 

Applying the same process to (5)，we obtain 

(B) Cktji + c k jji + c kU j = 0. 

In (B) we interchange k and /， to obtain 

Cli'jk + + Cfd jj = Q. 

Comparing this equation with (A), we obtain the first of the desired relations, 

^ij,kl — c kljj. 

Moreover, using this relation with either (A) or (B) ? we obtain the second of the 
desired relations, 

Cli，jk + ^Ikjj = 

And thus we are done! ♦ 


When we put all these results together we see that the quadratic function 

1 • • • • 

Q(Vpy Wp ) = - E 3 k ^ J l dx l (v p ) dx J (v p ) dx k {w p ) dx l (w p ) 

ij\kj 

can be written 



We thus see that the quadratic function Q, obtained from the Taylor expan¬ 
sion of || || 2 in Riemannian normal coordinates, has special properties which 
allow us to define, for any 2-dimensional subspace W C M p , a number 


Q(W) 


Q(Vp ， W p ) 
\\Vp.Wp \ 2 


v p ，Wp any basis for W, 


The work of the last four Propositions, which establishes this fact，is completely 
suppressed in Riemann^s account, where the final result is merely stated, at the 
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beginning of section 2 of Part II. Riemann then makes some remarkable claims. 
First, Riemann interprets Q for a surface: 

(1) If M is 2-dimensional and W = M p , then ~^Q{W) is just the Gaussian 
curvature K{p) given by Theorem 3-7; we thus have an intrinsic def¬ 
inition of K, obtained by picking a special class of coordinate systems 
determined by the metric. (Riemann needs the factor —3/4 because he 
divides Q(v p , w p ) by the square of the area of the triangle spanned by v p 
and w p .) 

At the end of section 2, Riemann interprets Q for an «-manifold: 

(2) If M is «-dimensional, W CZ A/p is 3 . 2 ~<dimension3.1 subspa.cc, 3.n<d (9 CZ 
W is a. neighborhood of 0 € W 7 on which exp is a diffeomorphism, then 
—3 2( WO is the Gaussian curvature at p of the surface exp(0), with the 
metric it inherits as a submanifold of M. 

But the most important claim is made in section 2. In an «-dimensional vector 
space there are “independent” 2-dimensional subspaces: if ,..., is a 
basis, we can choose the subspaces spanned by Vi and Vj, for i < j. Riemann 
claims that the metric ( , ) is determined if Q{W) is known for indepen¬ 
dent 2-dimensional subspaces W C M q at each point for example, if Q is 
known for the subspaces spanned by each d/dx l \ q and d/dx J \ q (i < j). 

A very special case of this general claim is the following, which we will hence¬ 
forth call the Test Case ： 

(3) If M is «-dimensional and Q — ^ for independent 2-dimensional 
subspaces of each M q , then M is flat, that is, M is locally isometric to 
with its usual inner product. 

In connection with the Test Case, it should be pointed out that a local isometry 
with is the best we can hope for, since there are Riemannian manifolds 
which are not homeomorphic to R rt , but which are locally isometric to R rt , and 
hence have Q — ^ everywhere. The simplest example of such a manifold, the 
“flat torus”，is constructed as follows. The torus T can be obtained from R 2 by 
identifying (.v, v) with (a 、 vO if and only if 

J’ — J ， — A* € Z 

(compare pg. 1.372). The map 7T: R 2 -> T\ defined by taking (x, j) to its 
equivalence class, is locally a diffeomorphism, and there is clearly a unique 
metric ( , ) on 7" such that n*{ , ) is the usual Riemannian metric on M 2 ; 
consequently, (7\ 〈， 〉）is locally isometric to M 2 with its usual Riemannian 
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metric. Notice that the usual torus in IR 3 , with the induced Riemannian metric, 
is not flat; it has positive Gaussian curvature on the part furthest from the axis, 



and negative Gaussian curvature on the part nearest the axis. However, if we 
consider 5 1 C M 2 , then it is easy to see that 5 1 x 5 1 C M 2 x R 2 , with the 
induced Riemannian metric, is flat. 

One other remark should probably be made about the Test Case. At first 
sight, the Test Case might seem to be little more than a theorem about func¬ 
tions whose second partial derivatives are everywhere zero. However, it is ac¬ 
tually quite different from this simple sort of result, since the value of Q at 
different points is defined in terms of different coordinate systems, each chosen 
specifically for one point. 

Now our aim in the rest of this chapter is to prove assertions (1), (2)，and (3). 
(The general claim that Q determines the metric will be considered later, as will 
the information given in sections 4 and 5 of Part II.) However, we will defer the 
proofs of assertions (1) ，（ 2)，and (3) to another section of this chapter，not only 
in order to provide ourselves with a brief respite, but also to allow Riemann to 
add one or two more brilliant ideas. 
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C. A PRIZE ESSAY 

The second edition of Riemann’s collected works includes an unpublished 
paper, in Latin, which was submitted to the Paris Academy in 1861 , to compete 
for a prize on a question involving heat conduction. In 1868 , ten years after it 
had been offered, the prize was finally withdrawn. Because the way of obtaining 
the results of his essay were not fully explained, the prize was not awarded to 
Riemann, whose health prevented the more detailed handling of the subject 
which he had intended. 

An extract from this paper is given below.* It should not he very hard to 
read, hut the significance of the equations obtained there is only suggested by 
Riemann’s final remarks; in the next part of this chapter we will have a great deal 
more to add. In the translation I have made some minor changes of notation. 


An Extract From Riemann’s Paper of 1861 


Second Part 

On the transformation of the expression [ 9ij dy l dy 1 

ij 

into the given form X! a ij doc 1 dx J • 

When the inquiry of the third Academy is restricted to homogeneous bodies, 
in which the resulting conductivities are constants, we develop the first condition 
that the expression [ gij dy 1 dy J , in which the y l are functions of the x l , can be 

transformed into the form dx 1 dx J with given constant coefficients a[j. 

ij 

The expression dx 1 dx J , if it is, as we shall suppose, a positive form in 
the dx\ can always be put in the simplified form (dx 1 ) 2 . Thus if YlOi) ^ ^ 

can be transformed into the form J2 a ij^ xl ^ can likewise be reduced to 

ij 

the fomi Yl (dx 1 ) 2, and vice versa. We therefore ask whether it can be put in the 

i 

form E (dx 1 ) 2 . 


* Certain omissions, indicated by are considered in Addendum 2 to Chapter 6. 
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Let G — det (gij) and let yij be the cofactor; in this way = G and 

i 

^ dijlfik = 0 if J < /c. 

i 

If Y^gijd^dy] = idx 1 ) 2, for arbitrary values of the dx 1 ，substituting 

■ _ • 

hj i 

d 8 for d leads also to [ dy 1 8y J = dx l 8x^ for arbitrary values of the 

ij i 

_ _ 
dx l and dx l . 

Consequently, if the dy 1 are expressed in terms of the dx l and the in 
terms of the dy 1 , it follows that 


⑴ 

dx^ 

dy a 

and consequently 


⑵ 

■ 

dx^ 

Thus we further deduce, seeing that 

y-s dyi dx a i 
dx a 

(Y u 

and 


> : 9ai 
i 


8y l 

dx^ 


Jai 

G dy a 

ry ^ 




dy 1 dx a o 
dx a dy] 



⑶ 


I ： 


dx a dx a 


9ij 


dy 1 dy) 

and differentiating* formula (3), 


⑷ 


3y l 3y j _7ij 


a 


dx a dx a G 




d^x a dx a 




d^x a dx a c)gij 


a 


dy } dy k dy) — dy^dy k dy 1 dy k 


Now from these expressions for 


we can write 

⑸ 


d 9U 

3y k 


d 9jk 
dyj ’ dyi 


21 ： 

a 


d 2 x a dx a 
dy)dy k dif 


d 9ij + ^9^ _ ^9jk 
dyk dy] dy } 
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and if these quantities are designated by then 


⑹ 


2 


d^x a 
dy^dy k 


E 


dy l 

dx a 


Pijfc 


Differentiating the quantities again yields 


dyf dy k ^ dyidyl 




d 2 x v #% v 


dy〕dyl dy l dy k 


whence finally, substituting the values found in (6) and (4 )， 


(I) 


^ 2 9ik , d2 9j! d 2 g u ^ 2 9jk 


+ O (PajlPlSik - PailPf3jk)y^ 


dy^dy 1 dy i dy k dy^dy k dy l dy l 2 


a，/3 


G 


The functions g^j must necessarily satisfy these equations whenever 

[gijdy 1 dy、’ can be transformed into the form J](c?x 2 ) 2 : we denote the left 
• • _ 

hj i 

side of this equation by 

(ijMl 

•… Given an acquaintance with the traditional methods, it is demonstrated 
without difficulty that these ... conditions when they are satisfied, suffice_ 
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D. THE BIRTH OF THE 
RIEMANN CURVATURE TENSOR 


All the developments in this part of the chapter have their origin in the fol¬ 
lowing question, which Riemann considers in the paper of Part C : When is a 
Riemannian manifold (M, ( , )) flat (locally isometric to M w with its usual Rie- 
mannian metric)? In other words, when is there a coordinate system x 1 ,..., 
on M for which 

n 

(1) 〈，〉 = ^ dx l (g) dx l ? 

/ =1 


We are going to seek an answer to this question in as straightforward a manner 
as possible; the quadratic function Q will not be used at all, but at the end it 
will make a surprise appearance. 

We begin by choosing an arbitrary coordinate system y, in terms of which 
the metric ( , ) can be written 

n 

(2) 〈，〉 = E gij dy l (8) dy j ; 

ij=l 


and we then seek conditions on the gij in order for (1) to hold for some co¬ 
ordinate system x. Since this is a purely local question, we can assume that 

少 1 ， ... ，少 n is just the standard coordinate system on M w . 

• » 

If we express the dx l in terms of the dy J ， and equate the coefficients of 
dy l (g) dy } in (2) with the resulting coefficients in (1)，we find that the coordinate 
system x 1 ,..., has the desired property if and only if 


( 3 ) 


I ： 


dx a dx a 


a 


dy l dyj 



From equation (3) we can immediately derive another, for we obtain 


j •爲 


dyj dy k 
3.v a 3 .y 卢 


=E 

}^,0t 


dx a dx a 3 v 7 * dy k 
dy' 1 dy-^ 3a^ 


▽ ^ X<x gq 3y k 


a, 卢 


E 


a 


dx a dy k 

ay 9x« 


'k 
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which shows that if the coordinate system x\ .,. ,x n has the desired property, 
then 


(4) 


d y l d y J = 2 u 

^ dx^ 


Conversely, (4) implies (3). These results are just the equations (3) and (4) that 
Riemann obtains. Notice that Riemann begins with the square of the norm 
|| || 2 = Ylg" dyi • dy ) ， and then uses polarization to obtain the inner prod¬ 
uct, which he writes as Sij dy l 8y J . Riemann also treats the two coordinate 
systems x and y on an equal footing throughout, so that his derivations of (3) 
and (4) are somewhat different. From (4) we obtain 


- 


i. 


dy l 


E 




dy l dy^ dx^ dx v 

: 3x0 9〆3〆 




and thus the coordinate system x l ,... ,x n has the desired property if and only 
if 



jy j 



dx^ dx v 
dy l dy^ 


= 


This equation, which we will find more useful than (4)，can be derived directly 
from (3) in the following way. A = (a ”） = (dx l /dy J ), and G = (gij), then (3) 
says that 

A 1 A = G, 

where A 1 is the transpose of A; this is equivalent to 

G— 1 = A~ l • (A^ 1 , 

and hence to 

火 G—= /， 

which is just (4 / ). In particular, this shows immediately that (4 r ) is equivalent 
to (3). 

Now equation (3) is a partial differential equation for the functions x a . In 
Chapter 1.6 we developed a general theory for partial differential equations, 
but we notice at once that (3) is not an equation of the type to which our theory 
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applies. Our first task will thus be to obtain from (3) an equation that we 
do know how to handle. The situation is very much like, and may profitably 
be compared to, that which occurs in Problem 1.7-19, where the analysis of a 
certain set of partial differential equations is reduced to Theorem 1.6-1，together 
with the Poincare Lemma. To treat equation (3)，we begin by differentiating 
(about all we can do), to obtain 


E 


d 2 x a dx a 

dy^y^ 


+ E 

a 


d 2 x a dx a 
dy^dy k dy l 


^gij 


By writing down this equation for 

3g/y d Sik hjk 
dy k 


dy^ 5 dy l 


and combining, we obtain an equation equivalent to Riemann’s, 

/r . d 2 x a dx a 1 (dgij dg^ dgj^ \ 

⑶ [ d y Jdy k ay = 2 V3 / + ^ ~ "9^") = [片 


Thus, the symbols [y/c, which came up naturally in the calculus of variations, 
also come up naturally in this different con text. After Riemann’s Habilitations 
lecture was published, in 1866 ， several mathematicians independently derived 
his results or considered related questions. Christoffel，in particular, introduced 
these combinations of the partial derivatives of the gij and the symbols [ij\ k] 
and r^- are called the Christoffel symbols of the first and second kinds ， re¬ 
spectively (Christoffel ac tually used [^] and { 父 }， which do not accommodate 
themselves to the summation convention). In the next chapter we will see one 
important use which Christoffel made of these symbols. 

At this point we will depart slightly from Riemann’s treatment, in order to 
obtain equations to which Theorem 1. 6-1 directly applies. From (5) \ve obtain 



E 


3 2 .\ a dx a dx x iy 
dyJdy^ 3 V* 3 v y ^ 


32 / a.V tt 3 A^ - y \ 

^ 3W3 ― 3v z dy y ) 

= E by (4 ’)， 

/V • ^ 
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so we obtain, finally ， 


( 6 ) 


3 2 x 入 


n y dX X 


E 


dyj dy^ ^ dy y 


(which is easily seen to be equivalent to Riemann’s equation ( 6 )); we will also 
write this equation as 

3x 入 

I 

3x 入 


dy k 


E 


'Y 


y 




Notice that the index k plays no special role here; all functions satisfy the 
same equation. Thus, for each X the n-tuple of functions 


a 


dx^ dx^ 

㈤…，^ 


a: R w 


satisfies the set of partial differential equations 


(*) 


da 

dy k 


(y) = fk{yMy)), 


where W 1 x W 1 M rt is given by 


n 




y 


y ： 


Since this is true for every X, the equation (*) has n solutions whose initial values 
at some point ，0 say，are linearly independent. Since constant linear combina¬ 
tions of solutions of (*) are also solutions, it follows that (*) has solutions with 
arbitrary initial conditions at 0 . From Theorem 1.6-1 we thus obtain necessary 
integrability conditions ， 


d A-^L 

dv l dv k 








In our case, looking at the 7 th components of these equations, vve obtain 


« r)ry 

V jk J 

乙、 dy l 匕 3 v 々 

y~l / - 


-E& 

j/=l - M 


n 


n 


n 


n 




y = 0 . 


y- 




y- 
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Since these relations must hold for all z = (z 1 ,..., z rt ), we obtain 

3r, y 


氺氺 


0 = R y i 


jlk 


def ^lj 


dy l dy 


k 


ixmw) 




as necessary conditions that g" dy 1 ^ dx l <2>dx l for some coordinate 

system x = {x\ , x n ). Notice that the set of equations R Y jik = 0 is equivalent 

to the set of equations 

n 

Rijlk = f y^,Siy RY ilk =0. 

y = l 


The quantities Rijik can be expressed in another way, after a little calculation. 
Note first that 


J 2 gi y 

y=l 



a 少 




V=1 7 

3[jkJ] 
dy 1 


v 


dgi 


Y 


_I] r A([",K] + [W，0). 


Y- 


Substituting into (**), and remembering the definition of [ij,k], we obtain 

( d 2 gik , d 2 gji d 2 gu d 2 g jk 


(***) 


R ijlk 


2 \ dyj dy 1 dy 1 dy k dy^ dy k dy i dy 1 


n 

a, 卢 =1 


The condition Rijik = 0 is just the condition (I) which Riemann obtains (note 
that Riemann’s pij^ equals 2[jk, /]) — the quantity which we have denoted by 
Rijik is what Riemann denotes by 2{ij\kl); the factor of 2 is not particularly 
significant, nor is the interchange of I and /c, for it is easily seen that Rijik = 

—Rijkl- 

The notation R l jki has been picked in anticipation of the following result. 


5. PROPOSITION. On a Riemannian manifold (M, { , )) there is a tensor 
of type (j) whose components in any coordinate system y are 




jkl 




3 ； 


、k 


3 V 7 


n 






(where { , ) — [ g" dy 1 (g) dv J , and the Christoffel symbols T are defined as 
usual). 
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PROOF. We just compute that the components transform “correctly ”!！ In other 
words, if R fl jik are defined by the same formula, with respect to the coordinate 
system we show that 

p/a V Pi dyJ dyfa 

R dyfi dyy dy^ * 

To do this, all one needs is the result from Problem 1.9-22, 

r y 二 y ^ ay W a 广 士 a 2 严 a ， 

^ dy fa dy f ^ dy^ dy fct dy f ^ 3 少 〆 ’ 

i,j，k /i=l 

and plenty of perseverance. 

SLIGHTLY MORE MOTIVATED PROOF. Begin with the equation 

^gij dy l ® dy j = { , ) = ^2,g t ijdy fi ®dy fJ , 

« « « « 

i, j i, j 

and repeat the whole sequence of computations which we performed in the 
special case that g、j 二 8". The result will be the desired transformation law 
(The integrability conditions (**) then follow as a necessary condition for the 
existence of a coordinate system y ! with g f " = 8 " ， for in such a coordinate 
system we clearly have R n jkl ~ which in turn implies that all R l jkl = 0.) 


We have thus stumbled onto a new tensor, the Riemann curvature tensor, 
which in the coordinate system y equals 

R l jki dy j ® dy k ® 

Eventually we hope to have a useful invariant definition of this tensor; this will 
involve an enormous amount of exploration. For the time being, we simply ac¬ 
cept the classical definition, which arises naturally as an integrability condition, 
and explain how it is connected with curvature. In the process we will obtain 
an invariant, but extraordinarily clumsy, definition of the curvature tensor. 

It will be convenient to introduce a bit of modern terminology, and denote 
by R the tensor with components R l jkl- Since this tensor is of type (}) it may 
be regarded as a function taking three vectors to another vector The value 
of R on Z e M p will be denoted by 



R(Y,Z)X eM p , 
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and hence we have 

R 


a 少 


k 


a 

\ 3 

/ ay 

p V 


n 


P 


J2 Ri m{p) 


dy l 


p 


(the reason for choosing the notation R(Y, Z)X comes out in Proposition 6). 
The numbers Rijki = giy R Y jkl are also the components of a tensor，of 


y 


type ( 0 )，but it is unnecessary to perform any calculations to verify this. Clearly 

a 


^ijkl(p) 



dy k 


a 

\ 3 

a 

/ay 

p d y J 

/ 3/ 


P t 


so the tensor in question is just the multilinear map 

(X, r, Z, W) (R(Z, W)Y,X). 


This function of four tangent vectors is closely connected with the quadratic 
function introduced in Part B of this chapter: 


6. PROPOSITION. Let x be a Riemannian normal coordinate system at /?, 
and Q the quadratic function on M p x M p defined by 


where 


Then 


PROOF. 


Q(X,Y)= Y, C UM dx\X) dx J (X) dx k (Y) dx l (Y), 




—1 ^ 2 gij 

CijM " 2d^J' 

Q(X,Y) = - l -{R(X,Y)Y,X). 
We have seen that 


30(m = ^ Cij^i{dx l A dx k ) • (dx J A dx l ){X, Y) 


— ^ Cij'fci dx l {X) dx\X) dx k {Y) dx l {Y) 

ij.kj 

+ X] Ci J^ kl dx l {X) dx l (Y) dx^(Y) 

— ^ Qy^/ dx\X) dx k {X) dx l (Y) dx l (Y) 

i 丄 k，l 

- dx\X) dx l {X) dx j (Y) dx k (Y). 

ij.kj 
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By switching indices we can rewrite this as 


3Q(X,Y)= dx ^ X ) dxJ ( r ) dx k {X) dx l {Y) 

+ C j^ ik ” 

~ XI Cil ^ k ” 

iJ,kJ 

- C j k ， il ” 

ij,kj 


[interchange 
j and k] 


[interchange 
i and /] 

[change i to /; 
j to /; / to j] 

[change i to k\ 
k to l\ l to /] 


二 〉: + Cjijk — Cjij^ — Cj^ ii) dx^ ® dx^ ® dx^ ® dx Y). 


Now in Riemannian normal coordinates, the Christoffel symbols [ij,k] are all 0 
at p, since all dgij/dx k are 0 at p. Referring to equation (***) we thus have 


3Q(X, Y) = ^ Rijik(p) dx 1 ® dx } ® dx k ® dx l {X, Y, X, Y) 

= — ^ Rijki(p) dx 1 ® dx } ® dx k ® dx l {X, Y, X, Y) 
^~{R(X,Y)Y,X). ❖ 


We are now ready to verify some of Riemann’s claims. 


7_ PROPOSITION. Let (M, { , )) be a 2-dimensional Riemannian manifold, 
and let X,Y e M p be linearly independent. Let \\X, Y\\ denote the area of the 
parallelogram spanned by X and Y. Then 


K(p ) 二 


{R(X^Y)Y,X) 

IIUli 2 


[= {R(X,Y)Y,X) if X and Y 
are orthornomal] 


is the same as the Gaussian curvature at p defined by the formula in Theo¬ 
rem 3-7 (in particular, this proves that the formula in Theorem 3-7 is indeed 
independent of the coordinate system), 

FIRST PROOF. Let (x, y) be a coordinate system on a neighborhood of p. It 
obviously suffices to verify the theorem when X = d/dx\ p and Y = 3/dy\ p , 
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since by Proposition 6, and the results of Part B, the numerator is multiplied 
by the same factor as the denominator when we change to any other pair of 
vectors. In this case, 


(R(x, 


R 



= 及 1212(/0. 


If we write 


< ， )=E dx ^ dx F dx ^ dy F dy ^ dx G dy ^ dy. 


so that 


gn = E ■ 

gn = ^21 = F 
gn = G, 

then (by the formula on pg. 1.308) 

ll^ril 2 = EG - F 2 , 

so we must prove that ‘ 

4R nn (EG - F 2 ) = 4(EG - F 2 ) 2 K, 

where the right side is given by the formula in Theorem 3-7. This is a fairly 
straightforward calculation from (***) on page 188. The first term in (***) 
corresponds to the last in the formula for 4(EG — F 2 ) 2 K, and the second cor¬ 
responds to the first three in the latter formula. In carrying out the calculation, 
note that 


G 


g 


EG - F 2 


g l2 = g 21 


F 


EG - F 2 


g 


22 


E 


EG-F 2 


the denominators cancel out the unwanted factor in 4R\2\2(EG — F 2 ). 
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SECOND PROOF (OUTLINE). Let (r,cp) be the coordinate system around p 
which is introduced on page 136. We know that in this coordinate system 

〈， )=dr ^ dr - G dcp ^ d<p 


for some function G, and (see page 145) that 

3 3 VG 


K(p) 


3r 3 


(Pi 


Introduce a Riemannian normal coordinate system x l ,x 2 by the equations 

x x = r cos (p, x 2 = r sin^?. 

We can then calculate the gij in terms of G, and use these results to show that 
the quantity 


Q 


d 


dx l 


3 


P 


， dx 2 


2c n> 22(p) 


P 


is equal to —K(p)/3. The result then follows from Proposition 6 . <♦ 


8 . PROPOSITION. Let (Af, { , )) be a Riemannian manifold, and let W be 
a 2-dimensional subspace of M p , spanned by X,Y e M p . Let 0 C VF be a 
neighborhood of 0 g M p on which exp is a diffeomorphism, let /: exp(0)-> 
M be the inclusion, and let R be the Riemann curvature tensor for exp(0) with 
the induced Riemannian metric /*〈，》• Then 

(R{X,Y)Y,X) = {R{X,Y)Y,X). 


Consequently, 


(R(X,Y)Y,X) 

urn 2 — 


is the Gaussian curvature at p of the surface exp(0). 


FIRST PROOF. It obviously suffices to prove the theorem when X and Y 
are orthonormal. Choose a Riemannian normal coordinate system at p with 
X = d/dx l |p, Y = 3/3x 2 |p ； then -Y 1 ,^ 2 is a coordinate system on exp(0). 
Now we are trying to prove that ^ 1212 (^) = R\ 2 \i{p)> But in (***)，the terms 
involving Christoffel symbols vanish at p. The theorem is now obvious, since 
the functions gij (/, 7 = 1 , 2 ) defining the metric / *{ , ) are just the correspond¬ 
ing gij restricted to exp( 0 ), and they have the same mixed partial derivatives 
with respect to a 1 and .y 2 . 

SECOND PROOF. It is even more obvious that the quadratic form Q associated 
with (exp(0),/*{ , )) is the restriction to W of the quadratic form Q on M p , 
for they are the second non-zero terms in the Taylor expansion of the same 
metric. 
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9. COROLLARY. Let (M， 〈， )) be a Riemannian manifold, let X,Y € M p 
span a 2-dimensional subspace W of M p , and let 0 C VK be a neighborhood 
of 0 on which exp is a diffeomorphism. If Q is the quadratic form on M 
defined previously, then 

-3Q(X,Y) = (R(X, Y)Y,X) 
hup — ^i^,np — ? 

where K is the Gaussian curvature at p of the surface exp(0). 

The quantity 〈及 (A" ， Y)Y, X)/\\X, Y\\ 2 appearing in Corollary 9 is called the 
sectional curvature K(W) of W. It would seem that the function (X, F) 

〈穴 (A" ， Y)Y, X) contains only a small portion of the total information contained 
in the curvature tensor, but Propositions 10 and 12, which follow, show that R 
satisfies certain identities which allow it to be determined in terms of the metric 
(,)and the quadratic function Q which it determines. 

10. PROPOSITION. The curvature tensor satisfies the following identities: 

(1) R(X ， Y)Z = -R(Y, X)Z, hence 

{R(X, Y)Z,W) = -(R(Y, X)Z,W). 

(2) {R(X, Y)Z, W) = -{R(X, Y)W, Z>. 

(3) R(X ， Y)Z + R(Y, Z)X + R(Z, X)Y = 0, hence 

(R(X, Y)Z,W) + (R(Y, Z)X, W) + {R(Z, X)Y,W) = 0, 

(4) (R(X, F)Z, W) = (R(Z, W)X, Y). 

PROOF. In a coordinate system x, these relations are equivalent to 

⑴ ^ jkl = ~^jlk = — Rijlfc 

( 2 ) Rijki = -Rjiki 

(3) k jkJ + 兄 /c" + ^ ljk = 0 。犷 Rijki + Rik" + ^iljk — 0 

( 4 ) Rkiij = Rijki • 

These are immediate from (**) and (***).♦:♦ 

Notice that {R(X, F)Z, W) is skew-symmetric in both (X, F) and (Z, W), 
which again shows that {R(X, Y)Y, X) changes by det(a") 2 when X and Y are 
replaced hy a\\X + a 2 \Y,a 2 \X + ajiY. For later use, we insert a result which 
shows that the fourth property of 穴 is a formal consequence of the others. 
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11. PROPOSITION. Let K be a vector space and (R:VxVxVxV-^R 
a multilinear map satisfying 

( 1 ) iR(X,Y,Z,W) = (R(Y,X,Z,W) 

( 2 ) (R(x, r,z,w) = (R(x, y, w,z) 

(3) (R(X, r,Z,W)-h (R(V,Z,X, W) + (R(Z,^, Y, W)^ 0 . 

Then (R also satsifies 

(4) (R(^r, y,Z, W) = (R(Z, W,X,Y). 

PROOF. The proof is a tricky manipulation, cleverly systematized by the fol¬ 
lowing diagram from Milnor’s Morse Theory. 



(R(Z, W,X,Y) 


Equation (3) shows that the sum of the numbers at the vertices of triangle W 
is zero. The sums of the vertices of triangles X, Y, and Z are also seen to be 
zero, using (1) and (2). Adding these identities for the top two triangles, and 
subtracting the identities for the bottom ones, we see that twice the top vertex 
minus twice the bottom vertex is zero. 

12. PROPOSITION. Let K be a vector space and (R,: V xV xV xV IRtwo 
multilinear maps satisfying (1)-(4) of Proposition 11. Suppose (Rj (X, Y, X, Y)= 
(R 2 (m F) for all X,Y eV. Then (R t = (R 2 . 

PROOF. It clearly suffices to prove that a multilinear (R satisfying (1)—(4) is 0 if 
(R(A"，H F) = 0 for all X, Y g V. Now we have 

0^(R(X,Y + W, X,Y + W) 

= (R(X, Y,X,Y) + (R(X,Y,X, W) + (R(X, W,X,Y) + (R(X, W, X, W) 

= (R(^, F, X, W) + (R(X, W, X,Y) 

= 2(R(X, V,X, W). 
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Using (1) and (2)，we easily see that (R is alternating, and hence skew-symmetric. 
Consequently, (3) gives 


Propositions 10 and 12 tell us that the curvature tensor R is completely de¬ 
termined by the values of {R(X, A"), and hence by the quadratic func¬ 

tion Q. [This means that in a sense we can frame a coordinate-free definition 
of the curvature tensor, but it would certainly be an awkward one. Moreover, 
given a multilinear map (R:KxKxKxK M, satisfying (1)—(4), it is a 
fairly difficult exercise to work out a formula for (R in terms of the quanti¬ 
ties (R(X, Y, X, Y).] In terms of a coordinate system, we see that the tensor 
(R(X, Y, Z, W) = (R(X ， F)Z, W) is determined by the components Rijij, of 
which there are with i < j. According to Riemann, these func¬ 

tions must determine the metric completely; in other words, the tensor (R must 
determine the metric.* 

Recall that we have selected one special case of this assertion as our Test 
Case, which can now be restated as follows: If 7? = 0, then the manifold is flat. 
We are ready to present the first, and longest, of our proofs of the Test Case. 
It is separated into three Steps, and all our subsequent proofs, no matter how 
elegant and brief, essentially contain these same three Steps. 

Recall that for a coordinate system y l ,..., we have the formula (pg. 1.331) 

(*) ^ = UkJ] + [jkJl 

which is equivalent to the definition of the Christoffel symbols, as well as the 
formula (pg. 1.331) 


氺氺 


h lj 

dv k 


n 




which can be derived from it. 


* This is not reallv the same as saying that R determines the metric, since we can’t 
determine R(X, Y)Z from (R(X,Y,Z,W) = (R(X, Y)Z,W) unless the metric is al¬ 
ready known! In fact, any numbers Rijki satisfying the identities of Proposition 6 can 
be realized as the components, at a point, of R for some metric (the Rijki determine 
the second derivatives of the metric at the point). 
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13. THEOREM (THE TEST CASE ； FIRST VERSION). Let (M， 〈， 〉）be 
an n-dimensional Riemannian manifold for which the curvature tensor R is 0. 
Then M is locally isometric to with its usual Riemannian metric. 

PROOF. This is a purely local question, so we assume that M is with the 
standard coordinate system j 1 ,..., y n , and the Riemannian metric 

n 

(,)=XI gi J dyl 公 d y j • 

U = \ 


Step 1. We claim that there are functions (h\ ， … ， h n ), with any desired initial 
conditions (/ii (0),..., /i w (0)), satisfying the equations 

W J^ = i2 r Jk h y 

The reason for this is, of course, that the relations R y jik = which express the 
vanishing of R, are just the integrability conditions for (*), as we have already 
seen. 

In particular, for a = 1,... ,n we can choose such a set (h ⑻ i ， … , h^ a \) 
satisfying the initial condition 

(/z ⑷ “0) ， … ， /z (a) „(0)) o = 4 


where X\,.. ,,X n g M w o is orthonormal with respect to { ， 〉 o. 

Step 2. We claim that if (/ii，• • •, h n ) satisfies (*)，then h = dx for some func¬ 
tion x, i.e., hj = dx/dy J \ In terms of the form 

rj = hi dy 1 + … + h n dy n . 


we are just saying that rj is exact. We know (Corollary 1.7-15) that this is true if 
and only if 


dhj dhk 
dy k dyj 

Glancing at (*)，we see that this is indeed true, since T】 k = 


r :r 


Now choose functions x a with h( a 、j = dx a /dy J \ Then the functions x a 


satisfy 

(t) 


d 2 x a 

dy^dy k 


n 


Er. 


Y 


3.v a 


r- 


jk dy 


y 


[these are the equations (6 )， 
obtained earlier, page 187] 
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and 


dx a 


r ⑼ 


a 火 


this matrix is non-singular, so x 1 ， … ， X 〗 
hood of 0. 






is a coordinate system in a neighbor- 


Step 3, We claim that x is the desired coordinate system, i.e.，that 


n 


(I) 


^ixv = g 


:; dx^ dx v 


u 




3/3〆 


[equation (4’)，page 185] 


We know that this equation holds at 0, by the choice of the initial conditions 
dx a /dy J (0). So it suffices to show that the right side of (J) has all partial deriva¬ 
tives d/dy^ equal to 0. But 


3 




k 




g 


dx^ dx 


ay 


dx v \ ^ 

W) = Ti 


dg^ dx^ dx v 
^ dy^ dy^ dy J 


E 发 




d 2 x fM dx v 


dy l dy^ dy^ 


E〆 7 


:: dx^ d 2 x } 


dy l dy^ dy k 


E 


dx^ dx V ▽ u y-y dx^ dx V 


~ dy^ dy l dyj 

E ， E r 


Y 


dx^ dx v 




y 


3^ 7 ' dy 


3y y dyJ 


by ⑴. 


Switching some indices, we thus have 




g 


" dx v 
dy l dyj 


) = hw 


dx^ dx v (dg" 
dyj \ dy k 




yk 


y 


Y 


0 by (**). 


This completes the proof of the theorem. ♦ 


As a brief review of the proof，we note that 


Step 1 uses the integrability conditions, R = 0, to obtain certain forms 
X!/ dy l , with any desired initial conditions; 
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Step 2 uses symmetry of the Christoffel symbols，rfj = to prove that 
Yli dy f — dx a for some 

Step 3 uses the definition of the Christoffel symbols [ij^k\ to prove that the 
vectors d/dx a are orthonormal. 

Despite its length, the proof is essentially a straightforward application of the 
integrability conditions for partial differential equations. As Riemann says, at 
the end of the section in Part C, “Given an acquaintance with the traditional 
methods, it is demonstrated without difficulty that these ... conditions, when 
they are satisfied, suffice.” 

We have thus proved one special case of Riemann’s assertion that the curva¬ 
ture determines the metric. We will not return to the more general assertion 
until Chapter 7, for our immediate task will be to begin systematizing all the 
results which have been uncovered so far. 
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ADDENDUM 

FINSLER METRICS 


A Minkowski metric on a vector space K is a function F: V ^ R such that 

尸⑻ > 0 for all i; 0 

Clearly F is completely determined by its “unit sphere” {u : 尸 ( 1 ；) = 1}; the unit 
sphere is symmetric with respect to 0 G V", and intersects every ray through 0 
exactly once. Moreover, any such set is clearly the unit sphere for some F. The 




function F is never C°° at 0, but F 2 may be, in which case we will simply say 
that F is C°°. The general metric which Riemann mentions is essentially an 
assignment of a C°° Minkowski metric F p to each tangent space M p , in such a 
way that F p varies smoothly with p. We will call such an assignment simply a 
“metric” on M. 

If c: [a,b] M is a curve in a manifold M with such a metric, then we can 
define the length of c to be 




F c{t) (c\t))dt. 


a 


If p: [a,b] [a, b] is an increasing diffeomorphism, and we denote 

by g ⑴， then 

length of c o /? = f F c{p{t)) ((c o p)\t))dt 


a 


>b 


Fc{p{t))ip\t) ^c\p(t))dt 


a 


P~Hb) 




p\t)g{p{t))dt 


>b 


g(t) dt = length of c. 


a 
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The same result clearly holds if p is decreasing; thus, the length of a curve is 
independent of parameterization. It is to insure this result that we require our 
metric to satisfy F(Au) = |A| - F(v). 

Although a C°° metric F on a manifold M is not a tensor, it can be used 
to construct a tensor on the manifold TM. To do this, we first consider a C°° 
function /: V ® on a vector space V. For any two vectors v,weV we can 
form the second derivative 




d 2 

di^ 


t~o 


this is a sort of second order directional derivative at v. If ,..., is a basis 
for V, and 0: M is defined by 



then 


■^( 自 〜) (E c，i) -) = E Sb {b)cic] - 

The map /**(u): K R is called the Hessian of / at u G K When F is a 
Minkowski metric, it is clear from the definition that 


(F 2 )** ⑻ (u)=2 剛 ] 2 . 

[This Hessian may be compared with the Hessian /** defined in Prob¬ 
lem 1.5-17 for a function /: A/ M, at a point p e M where f^ p = 0. The 
latter is a bilinear function on M p , whereas the present f^(v) is a quadratic 
function on V. The associated bilinear function is easily seen to be 


3 2 

(M2 卜 — 


/(v -h SW\ -h tw 2 ), 

( M )=0 


and in terms of a basis it is given by 



n 


C Vf 




d 2 <j> 


dx^xj 


rb - c l 

f " 


the same formula which occurs in Problem 1.5-17. Our Hessian is defined 
even at points where # 0 because we are working with a vector space, and 
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identifying it with its tangent space at v\ this amounts to saying that wc are 
considering only linear changes of coordinates, all of which leave the quantity 
defined by this formula invariant.] 

Now if M is a manifold, with a C°° metric F, and v e M p , then the tan¬ 
gent space (TM) V of TM at v can be identified with M p \ in accordance with 
this identification, we denote a vector in (TM) V by w v . We can now define a 
tensor ^ on TM by 

If F is the norm || || associated with a Riemannian metric ( , ), then it is easy 
to see that 

尹 — 切 ) p ， 

and in general we always have 

[F(v p )] 2 ^J^(v v ). 


If x is a coordinate system on M, and (x o 丌， x) is the corresponding coordinate 
system on TM (defined on pg. 1.81), then 


where 


3^ = 〉: gij dx^ - dx J 


、 1 d 2 (F p 2 ) 

Siji^p) = - u u ( 々 ). 


2 dx ( dxJ 

Classically, one dealt only with the functions g/y, defined by this formula, and 
checked that the function 



幺 " W 

/,；=i 


was independent of the coordinate system x. 

A Finsler metric is defined to be a metric F such that the quadratic function 
F : (TM) V M is positive definite for all v e TM. In terms of a coordinate 
system x, this means that 


(Sii (U/0) 


d 2 (F P 2 ) 


2 \ dx l dxj 


r( v p) 


is a positive definite matrix. 


To interpret this condition geometrically, we consider once again a function 
f : V M. Suppose that for all v we have /**(u)(u;) > 0 for all w ^ 0. 
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Then for all v, and each straight line through v, the function / obtained by 
restricting / to this line has an everywhere positive second derivative. By a 



standard theorem {Calculus y pg. 220) this means that the function / is convex 
(the set of points above or on its graph is a convex subset of the plane). It is easy 
to see that consequently the function / itself must be convex (the set of points 
on or above its graph is a convex subset of K x R). Conversely，convexity of / 
implies that /**(v)(u;) > 0. Our stronger condition /**(u)(u;) > 0 might be 
called “very-strict-convexity of f”. 

When we apply this to a Finsler metric, we see that each function F p 2 is 
convex on M p , so that each C p — {(v,r) e M p x M : r > F p 2 {v)} is a convex 
subset of M p x M. It is easy to see that this is equivalent to convexity of the 
“unit ball” {w : F p (w) < 1}, which may be regarded as the intersection of C p 
and the hyperplane r = 1 in M p x R. Note that convexity of the unit ball is 
equivalent to the “triangle inequality” 

F p (w\ + w 2 ) < F p (w\) + F p (w 2 ). 

In general，a function || || : V —> M on a finite dimensional space V is called a 

Banach space norm if 

||u|| >0 for i; # 0 

|| 入 HI = W • IMI 

I|u + u;|i < \\v\\ + ||u;|| 

(if V is infinite dimensional, the definition is more involved). So a Finsler metric 
on Af is a C°° Banach space norm on each M p , varying smoothly with p (and 
with the unit balls on each M p satisfying a very-strict-convexity requirement). 
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Although we will not develop the theory of Finsler，or more general，metrics 
here，we will mention a few facts. In the case of a Finsler metric, since the 
matrix (^/y(^)) is non-degenerate, we can define g 1 ^ (v p ) so that 

n 

( v p) • Sjk( v p) — 

7 = 1 

The reader may seek an invariant description of the g” (v p ). We can also define 
the symbols 

r .. , n 1 (^gik , dg Jk dgij \ 

/ =i 

as before, except that they are now functions on TM. It turns out that the 
critical paths for length are curves c : [a 7 b] M which, when parameterized 
by arclength, satisfy 

n 

+ E 咖 ⑹) • 

乙 /=1 

It also turns out, just as in the Riemannian case, that sufficiently small pieces of 
these critical paths are actually paths of shortest length. However, this is false 
for more general Minkowski metrics. 

We shall not pursue the subject of Finsler metrics much further, but we 
will add some remarks about Minkowski metrics F on a vector space V. A 
Minkowski metric F can be used to define a “distance” function on K x K, 
by (u, w) i-^ F(w — v) (however, it is easy to see that this distance function 
satisfies the triangle inequality, and is consequently a metric, if and only if F is 
a Banach space norm on V). This is just the procedure by which, in analytic 
geometry, we define the distance between two points in W 1 ; in this case, we 
choose F(x) = (XX-^O 2 ) 1 ^ 2 ? motivated, of course, by the Pythagorean Theo¬ 
rem. After the Pythagorean Theorem has been incorporated into our definition 
of distance in this way, it is interesting to ask what content, if any，remains to 
this theorem. The answer is, that the Pythagorean Theorem has been declared 
true only for right triangles with sides parallel to the axes，but remains true for 
all right triangles. This is because, when F(x) = (XX-^O 2 ) 1 ^ 2 ? the isometries 
of R n are transitive on the unit sphere. That is to say, if p and q are in the unit 


W(cCy)) dx } {c{s)) 


ds 


ds 


0 . 


d 2 x k {c{s)) 

ds 2 
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sphere, then there is a linear transformation 0: ^ with <p(p) = q and 

F(4>(x)) = F(x) for all x. 

This same transitivity property holds for any F which is the norm || || as¬ 
sociated to a positive definite inner product ( , > on an /z-dimensional vector 
space V, since there is an isomorphism /: U n V such that / *( , ) is the 
usual inner product on R n (Theorem 1.9-3). It turns out that this property ac¬ 
tually characterizes the Minkowski metrics F which arise from inner products. 
To prove this, we need an auxiliary concept, and a result from linear algebra. 

An ellipsoid on a vector space K is a set of the form {v e V : (v, v) < 1} 
for some positive definite inner product ( , ). In particular, consider such an 
inner product ( , ) on which also has its standard inner product (,). 
The ellipsoid {v e V : (v, v) < 1} really looks like an ellipsoid, because of the 



following well-known result: 


14. PROPOSITION (EXISTENCE OF PRINCIPAL AXES). If ( , > is any 
positive definite inner product on then there is a basis for which is 
orthonormal for ( , ) and also orthogonal with respect to (,). 

PROOF. For each x e the map y {x, y) is a linear functional, so there 
is a unique Tx eW 1 such that 

(Tx,y) = (x,j> for all y. 

It is easy to see that x Tx is a linear transformation 7": —> M w . Moreover, 

(Tx,y) — (x,y) = (}\x) = (Ty\x) — {x, Ty), 

so T is self-adjoint with respect to {, > Thus T has a basis Xi,... ,x n of 
eigenvalues, TXt — X/X/, and the Xi can be picked orthogonal with respect 
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to ( , ). Now 


= 乂】 {TXi , Xj ) = A,/A,y (X/, Xj ) 

= Txj) 

— ki(Xi,Xj). 

So (x/, Xj) = 0 if 入 ★ Xj. On the other hand, if two or more x/ have the 
same 入 /， then in the m-dimensional sub space which they span we can pick m 
eigenvectors which are orthogonal with respect to ( , ). So we can assume that 
{xi, Xj) =0 for i ^ j. Now we just normalize each X/ with respect to ( , 

We now use this result to prove the basic lemma for our main assertion. 


15. LEMMA. Let 5 be a bounded neighborhood of 0 in an w-dimensional 
vector space V. Then among all ellipsoids containing B there exists a unique 
one of smallest volume. (We assign a volume to the ellipsoids by choosing an 
isomorphism of V with Choosing a different isomorphism clearly does not 
change the property of having the “smallest volume ”.） 

PROOF. First we prove existence. We might as well assume that V = 
Moreover, we can assume that B is closed, since an ellipsoid containing B also 

contains B. Choose r and R so that 

{x 6 : |x| < r} c B c {x eR n : |x| < R}. 

Every orthogonal basis 6 = (i ； i,..., u n ) of determines an ellipsoid E(b) with 
principal axes V\,... ,v n [equivalently, b determines an inner product on W 1 , 
namely the one which makes V\,. .. ,v n orthonormal]. To prove existence it obvi¬ 
ously suffices to consider only ellipsoids of volume < aR n , where a is the volume 
of the unit ball. Now the principal axes ,..., of any ellipsoid containing B 
must have lengths a,* > r. Consequently，if this ellipsoid has volume < aR n , so 
that Yii S R n , then each a/ < R n /r n ~ x . 

Consider the set {b = (i ； i,..., u n ) : B C E{b) and length Vj < R n /r n ~ x }. 
This is a compact subset of the w-fold product x • • • x Hence b i-^ 
volume E takes on its minimum on this set. This proves existence. 

Now consider two different ellipsoids containing B, with the same volume. 
Choose an isomorphism of V with which makes the first of these ellipsoids 
correspond to the ordinary unit ball {x 6 : f\(x) < 1}, where f\(x)= 

[(•X 1 ) 2 . Proposition 14 shows that after a rotation of the axes, the second of 
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the ellipsoids corresponds to {x eR n : / 2 (x) 5 1}, where 


/ 2 (x) = a\(x 1 ) 2 + • • • + a n (x n ) 2 . 



The volume of this ellipsoid is 1 1 times the volume of the unit ball. 

Since the two ellipsoids are assumed to have the same volume, this means that 


n 


]~[«/ = 1 . 


Now consider the ellipsoid 

E = 




Clearly E also contains B. Now the semi-axes of E have length 

1 


1 + «/ 


so the volume of E is the volume of the unit ball times 

n . 


n 


1 -\- at 
2 


Recall that for a , 办 > 0 we have 

a + b 


\fab 


< 


with equality if and only if a 


Consequently, 


n 


n 


n / … 






4 


n 


n 


aj 


and strict equality holds if some ai ^ 1, i.e., if the original two ellipsoids are 
different. This means that if two different ellipsoids containing B have the same 
volume, then there is another ellipsoid containing B with smaller volume. This 
clearly proves uniqueness of the ellipsoid with smallest volume. 
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16. THEOREM. Let 尸 ： K — R be a continuous Minkowski metric on an 
w-dimensional vector space V. Suppose that for all p and q in the unit sphere 
{v e V : F(x) = 1}，there is a linear transformation 少 ： V — V such that 
<j>{p) = q and F(<p(v)) = F(v) for all v e V. Then F is the norm determined 
by some positive definite inner product. 

PROOF. Let B = {v \ F{v) < 1} ? and let E be the unique ellipsoid contain¬ 
ing B of smallest volume. Clearly，there must be some point p with F{p) = 1 
and p G boundary E. Let q be any other point with F(q) = 1, and <j> \ V ^ V 



a linear transformation with 0(/?) = q such that F(4>(v)) = F(v) for all v e V. 
It follows easily from the latter property that ()>{E) D B. Moreover ， 4>(B) = B, 
so 0 is volume preserving. By uniqueness of the ellipsoid E, it follows that 
4>(E) = E. Consequently, q = 0(/?) e boundary E. In other words，every 
point q with F{q) = 1 is in boundary E. This means that E = B. ♦♦♦ 



CHAPTER 5 

THE ABSOLUTE DIFFERENTIAL 
CALCULUS (THE RICCI CALCULUS); 
OR, THE DEBAUCH OF INDICES 


A lthough Riemann essentially defined the curvature tensor, the (classical) 
notion of a tensor did not even exist in his time. The development of the 
“Calculus of Tensors” is due mainly to Ricci, and was carried out in the years 
1887-1896; in 1901 he and his student Levi-Civita gave a detailed description 
in a memoir Methods de calcul differential absolu et leurs applications. In addition to 
a comprehensive use of tensors, what distinguished the Absolute Differential 
Calculus, and gave it its name, was an important construction which greatly 
simplified all the concepts of Riemannian geometry, especially the curvature 
tensor. Instead of checking that the horrible formula used to define R l jkl trans¬ 
forms correctly, we are going to check that an equally mysterious 一 but not quite 
so horrible — formula transforms correctly, and thus defines a tensor; then we 
will define the curvature tensor in terms of this one. 

In 1869, in one of the earliest papers which took up Riemann’s ideas, Christof- 
fel had already made the observation which Ricci made the basis for his calculus. 
Suppose that F is a vector field, and 

Y = j2\ ] ~ 

^ dxJ 

^ dx /J 
7=1 


Consider the following symbols，defined in terms of the Christoffel symbols of 
the second kind for the coordinate systems x and x f : 


k J 


dV 


\h 


dx h 


+ EW 



d 以 

dx fh 
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It is easy to check that for these symbols — the sums of partial derivatives of the 
A 7 and certain linear combinations of them — we have 

\j dxh 3x，a 

hj = l 


dx’P dx J 


For the calculation we use the formula on page 189 together with the formula 


n 


入 


tOL 


iy 


dx fa 


when we compute dk ,a /dx f ^, the extra terms involving second partial deriva¬ 
tives of the x’ a just cancel out with the extra terms in the transformation formula 
for the Christoffel symbols. This calculation shows that there is a certain tensor 
field of type (J) which equals 


n 


A y ; /, dx h (g) 

hj=\ 


dxj 


in the x coordinate system. In classical terms, u if the A 7 transform like a tensor 
of type (?), then the A 7 ；/, transform like a tensor of type (I)' Similarly, if the A/ 
transform like a tensor of type Q, then the quantities 

k i,h = 告 - 史、 


dx h 


Vi 


are easily seen to transform like a tensor of type (^), so that there is a tensor of 
type (q) which equals 


n 


ki-h dx l (g) dx l 

i^h~\ 


in the x coordinate system. Generally, given a tensor of type (^), with compo¬ 
nents 


A 


J\ … J! 


/ J - . 


there is a new tensor, of type with components 


A 


j \ … j i _ 






n 

ik 


n 


il ...ik U 


dx h 




y i' ■ ■ 一 i ^ Is +1 — ] i 




s=l 


n n 




I v ,r + l "- l k 
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The proof is again just an enormous calculation. At the other extreme from 
this most general case is the special case of a tensor field of type Q on M ， i.e .， 
a function /: M -> M. Here we simply define 

f 一 i 

so that in this case at least the identity of the tensor field 

h—\ h—\ 

is no mystery — it is just df. 

The tensor of type which we thus obtain from a tensor A of type (f) 

is called the covariant derivative of A, since it is covariant of one order greater 

than A. It is also called the ‘‘absolute derivative^ of A; here the word “absolute” 

means that it doesn’t depend on a particular coordinate system. Various nota- 

tions for the covariant derivative are encountered — one sometimes sees 入心 ， or 
• » 

or even k J ih . Use of a semi-colon should avoid all possible confusion! 

A partial answer to the question “What does the covariant derivative really 
mean?” is given by the following observation. 


1 . PROPOSITION. Let x be a Riemannian normal coordinate system at the 
point p e M, and A a tensor of type (f) on M，with 


• * o 

… dx h 0 … 0 dx ik 0 —r- 0 … 0 




Then the components at the point p of the covariant derivative of A are just the 
ordinary partial derivatives 


(*) 


A Jl •••】! 

ii …/ 人 ' 


(p )= 




n 

•4 


dx h 


(户） • 


PROOF. In a Riemannian normal coordinate system at p, the Christoffel sym¬ 
bols [ij,k\ and hence also the rg，are all 0 at 


Proposition 1 can even be used to define covariant derivatives. The Riemann¬ 
ian normal coordinate systems at p are a natural set of coordinate systems ， 
determined by the metric 〈，》； any two such systems at p differ only by an 
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element of O(w), and it is easy to see from this that a definition of the covariant 
derivative by means of (*) would not depend on which one was picked. With a 
little work, one could then deduce the general expression for the components of 
the covariant derivative of A in any coordinate system. In succeeding chapters 
we will be giving still other interpretations of the covariant derivative. 

The operation of covariant differentiation obeys many rules analogous to 
those for ordinary differentiation. 


2. PROPOSITION. Covariant differentiation is a derivation and commutes 
with sums and contractions. For example, if A and B are tensors of type (】) 
and C is a tensor of type (J), then 

(Al^ 2 C^) u = A J . l i\C. h + A j .^ 2 C. j \ 

V IU2 li I\i2\h^n ^ I\i2 iy,h 






V ： 


Consequently, we have，for example, 


(s 


\ n 

Ai\h r v \ _ Ahh pv , A j\]i r v 

A vi 2 I - A vi 2 \h^h 十 C /3 ； A* 

厂 ， h v = l 


PROOF. Compute. Because of Proposition 1, the computations become trivial 
if one uses Riemannian normal coordinates. ♦ 


There is one tensor which we have on any manifold, the identity map of 
each M p into itself, with coordinates in any coordinate system. For the 
covariant derivative we have 








which shouldn’t be very surprising (what else could it be?). Aside from these 
general formulas, there are two of crucial importance. The first of them is about 
the covariant derivative of the tensors ( , > and ( ， ）*. 
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3. PROPOSITION (RICCI’S LEMMA). The gij and g 1 ^ behave like con' 
stants in covariant differentiation; that is ， 

Sij\k — ^ 
g lJ ；k = 0. 

PROOF. The proof is, of course, a calculation. For the g" we have 

gij;k = ^ - 

v=l v=l 

= 0， 

by equation (*) on page 196. Similarly, the second equation is equivalent to 
the following equation (**). It can also be obtained from the first equation and 

Proposition 2, since Ylj Sjl = and= ().♦:♦ 


The second crucial formula involves “second order” covariant derivatives. 
If ^4 is a tensor of type (,)，with components 々 . 乂 ’， then the operation of 
covariant differentiation can be applied to the tensor B with components 

… ji _ aJi-- h 

D i\...i^h _ 八 / 


there results a tensor C with components 

广…) / —只力…力 一 f jji … h 、 


These components are denoted by 


々 … ji 


or simply 


…力 


For example, if we start with a function / [a tensor of type Q], then 


/;/ = 


¥_ 

a? 


and 


f\ij 



-E 


3x v 




a 2 / ¥ r , 

3x / 3a^ 乙 3x v 力 
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Notice that 


fuj — f'Ji 


by symmetry of the r^.. The same result definitely does not hold for other 
tensors; instead we have the following basic result. 

4. PROPOSITION (RICCPS IDENTITIES). If the ^ and A/ are compo¬ 
nents of tensors of type (?) and Q, respectively, then 


n 

^ l ;jk ^ \kj = — V R l Ijk 

l-\ 


n 

^i\jk ~ ^i\kj — Rli jk ， 

/ = 1 


where . . 

1 

(There are similar identities for tensors of type (^), but we will ignore them.*) 
PROOF. Compute. 

[The second identity is a consequence of the first, for if we are given A/ and 
define 

n n 

^ = J^g ia ^ SO that h = ， 

a=\ a=l 

then 


_ = ( / ^ g/a 入 

V=1 

n 

= gia(^ a ；jk - ^ a ； kj) using Proposition 2 

a=\ 

* For those who cannot bear to be left in ignorance, the general Ricci identity is 



A … _ A Jy … Jl 


k 


n 


n 




J\ JfV 

irhf) 




Js—l v 7 j +1 ••• 7 / rJ s 
ik 


vhr) 


r=l v=l 


v=l 
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n n 

= - HgiaV R°Ujk 

Of = l / = 1 
n 

= -[k 1 Riijk 
l—\ 
n 

— > : V Rlijk 

/ =1 
n n 

= YlYl gioc ^ 1 RCXi J k 

/ = 1 Of = l 
n 

= YlKR'jk ， 

a=l 


(by definition of Rujk) 


by Proposition 4-10 


and similarly the first identity is a consequence of the second.] 


5. COROLLARY. The R l jki are the components of a tensor of type (j). 

PROOF. Let Z be a vector field, with 

JL B 
dx l 

1 = 1 


• _ 

and let A be the tensor field with components k l jk- The first equation in 
Proposition 4 shows that 


(*) 



n 




dx i 


P ki it R ^) 


This shows, in particular, that B(d/dx^ \ p , d/dx k 1^) does not depend on the 
vector field Z，but on the vector Z(p) alone. So if we define 


R(X, Y)Z = -B(X,Y), 
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then 及 is a tensor of type (j), and 


B 


dx^ 5 dx^ 


)- 


R 


a 


厂 3x 灸 


Z 


/ =i \ 


dxJ ? dx k J dx l 


Since this is true for all n-tuples {A/}，comparison with equation (*) shows that 

_ • 

the components of R are indeed R l "k. 


[Classically，this corollary would be deduced from the following general princi¬ 
ple (compare Problem 1.4-5 (i)): 

Suppose we are given a set of numbers T l 狀 for the coordinate sys- 
tern x, a set T fl ijk for the coordinate system x f , etc. Suppose also 
that 

^jk ~ C fl j k = etc .， 

/ —l / =i 

are the components of a tensor for all tensors of type (?) with com¬ 
ponents in the coordinate system x, and components k fi in the 
coordinate system x\ etc. Then T l ”k are the components of a tensor. 


The classical proof is by a calculation. We have 


\ 、 \ f8 l ~p!OL 

2 ^人 r 脚 

8=1 


C I0L — 
c 办一 


i ， j，k 


dx fa 


dx J dx k 
dx^Jx^y 


. / i 3x’ a 3x) 3x 灸 
" 灸 3x’ dx ’ 於 dx fy 

i 3x’ 3x /a dx^ dx^ 
乙 ljk ~dx^ dx j dx，3 dx f y' 


Since this is true for arbitrary 入 ， we can choose all k f8 but one equal to zero; 
this gives the desired transformation formulas.] 

Corollary 5 represents only one minor application of the Ricci identities. 
A more significant application is obtained when we consider a manifold with 
vanishing curvature tensor. In this case, we have 

^\jk — ^\kj 


^•i\jk — ^i\kj • 
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Thus, in manifolds with a vanishing curvature tensor (and only in such mani¬ 
folds) the order of covariant differentiation is immaterial, so that in this respect 
covariant differentiation behaves like ordinary partial differentiation * This en¬ 
ables us to give a more direct proof of 


6. THEOREM (THE TEST CASE; SECOND VERSION). Let (M, ( , » 
be an «-dimensional Riemannian manifold for which the curvature tensor R 
is 0. Then M is locally isometric to with its usual Riemannian metric. 

PROOF. As before, we assume we are in M”，and choose the standard coordi¬ 
nate system 少 1 ， • • • ，少 ” around 0. 

Step 1. We claim that we can find 1 -forms T] = dy l , with any desired initial 
value 0(0)，satisfying 


(*) 


A- / j y — 0 i 9 J = 1，•••，” • 


To prove this, we begin by finding 入 /( 少， 0, • • • ， 0)，with the prescribed value for 
y = 0, and such that 


(*0 


0 = 入 /;1( 少， 0, •••，()) 


3 入 /(>，(),•• •，()) 
dy l 


n 

— h ( 少 • ， o)r 〔 .( 少 , o, 

v~\ 

This just involves solving a set of ordinary differential equations, which is even 
linear, so that solutions exist for all y where the r’s are defined. 



* At first sight, it might appear that this should always be the case, since covariant 
differentiation is the same as ordinary partial differentiation in a Riemannian normal 
coordinate system around p. However, the relation X l j = dk l /dx 】 holds only at p, so 
generally ^ l jk(p) ^ d 2 k l /dx J dx k (p). 
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Next we find 入 i •(少 1 ，少， 0 , •••，())， with the initial values 入 ,( 少 i ，0, • • •, 0) just ob¬ 
tained, satisfying 

(* 2 ) o = … ， o) 

_ 3 入 〆 乃，少, o ，...， o ) 
dy 2 

n 

- 入 v (少 i ，少，0,…， o ) r 公(少!，少，0,…， 0). 



We continue in this way until we eventually obtain 入 i •(少 1 ， ..., y n ~\ , y) satisfying 

(*«) 0 = 入/;« (少 i ， • • • ，， 少) • 

We now claim that, in addition to the relation A/ : i (^,0,... ,0) = 0 given 
by (*i)，we actually have 


To see this, we first note that we have 


■ i ;21 = 入 


3^i;2 

dy 1 


n 


n 


- - 


v ： 


V ： 


Since 0 = 入 /; 2 ( 少 ’l, >% 0,..., 0) by (* 2 )，we obtain 


ft 


i\2\ ~ - > : 入 /;v 厂 12 at (少 1 ， J ’， 0, •..，())• 


v ： 


Since R =0, the Ricci identities then imply that 


n 


入 m2 = 入匕21 ~ 入1>厂匕 at (少 1 , j ， 0 , …，⑴， 
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i.e.，that 

^ ^ n n n 

— ^v；i r* 2 / — Ai^r^ at ( 乃， 0 , …， o ), 

H 1；Z fa J im J ^ 1 ^ 

V = 1 V = 1 V =1 

so that 

3 入 •• n 

[ 入 v;i 「 公 = 0 at (ji, J,0, … ， 0). 

Since we have the initial conditions 入以 （乃， 0,… ， 0) = 0， it is clear that the 
solution of this equation is just the desired one, 

Proceeding in the same way, we next obtain 
入 /;i Oi, J 2 , J,0,...,0) = A/. 2 (ji, j 2 , J,0, ...,0) = A/ ;3 (ji, J2, J,0, ...,0) =0, 
and, eventually, 

0 = = A / ； 2 = • •. = Xi ;n at (ji,..., j«-i, j). 


This completes the proof of the claim. 

For a = 1we now choose q( a )= 人 (a) / dy\ so that 

X„ = (A ⑹ K0) ，…， A( a U0)) o 


are orthonormal with respect to ( , )o. 

Step 2. We claim that if 入 = 0， then Di dy l = dx for some function x. 
This is because we have 



n 


- 


v: 


which shows that 

9 入 / dXj 

dp = W' 

Now choose functions x a with )S a \ = dx a /dy l . As before, the x a are a 
coordinate system in a neighborhood of 0. 
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Step 3. We claim that x is the desired coordinate system, Le.，that 


n 




/,y=i 


ij dx^ 1 dx^_ 
dy i dyj 


n 


Y, ，入 0 ^ (v V 


hj 


As before, we just have to prove that the right side of this equation has all partial 
derivatives d/dy k equal to 0. But 


a 


3y k 


(F 


入 (M), ‘ 入 (V) 


) 



g0'x^x (v) j 


；k 


lJ + ， 7 入 ⑻， )M 

by Proposition 2 

0 by Ricci’s Lemma and equations (*).♦:♦ 


Comparing this proof of the Test Case with the first proof, we see that 


Step 1 uses the conditions 及 = 0 to obtain the forms dy l satisfy¬ 

ing (*). Instead of appealing to Theorem 1.6-1, we essentially reprove 
this theorem; the Ricci identities make the proof almost as easy as the 
proof of Theorem 1.6-0, the only complication being that the “mixed 
covariant derivatives” 入 ，; 12 depend on all not just on ki；\. 


Step 2, precisely the same as before, uses symmetry of the Christoffel symbols, 
r U = 


Step 3 uses the definition of the Christoffel symbols [z j\k] to prove that the 
vectors d/dx a are orthonormal. The proof is simpler because some of 
the calculations have been absorbed into the proof of Proposition 2, 
while the calculations using (**) on page 196 have been incorporated 
into Ricci’s Lemma. 


The absolute differential calculus turned out to be so useful (for many other 
applications besides the one just given) that it was soon exploited in the way 
all successful mathematical theories are — it was generalized. Notice that the 
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possibility of defining covariant derivatives depends only on the equation 


(*) 


r， ^ 


YA 

i_，M 


dx l 3x y dx /y 


n 


dx ,ot dx f ^ dx k 


E 


d 2 x^ dx ，Y 




dx fa dx f ^ dx ^ 1 


it does not depend on the particular way that the rfj are defined in terms of 
the gij . This observation suggests that we focus our attention on the trans¬ 
formation law (*) itself. Quantities which transform in this way are classically 
called connections. More precisely: 


A (classical) connection on a manifold M is an assignment of /i 3 num¬ 
bers to each coordinate system, such that equation (*) holds between 
the /i 3 numbers r^. assigned to the coordinate system x and the /i 3 

numbers T’j assigned to the coordinate system x f . 


Although this definition is exceedingly unappealing, classically it was motivated 
in the following way. If / is a function, then the quantities df/dx 1 are the 
components of a tensor, but if k l are the components of a tensor, then the 
quantities dk l /dx J ' are not. If we attempt to construct a tensor by adding linear 
combinations of the 入 ’， thus obtaining 







we find that these quantities are the components of a tensor provided that the r^. 
transform according to (*). 

Once we are given a connection (whatever in the world this connection may 
mean), we can imitate most of the work already done in this chapter for the 
special case where the rfj are the Christoffel symbols, and in addition we can 
generalize other considerations from Riemannian geometry. What follows is a 
brief outline of this program. 

We note first that, in contrast to the special case where the are the 

Christoffel symbols, a general connection need not satisfy r^. = r^.. How¬ 
ever, it is easy to see that the quantities 

1 ij - 1 ij L ji 

are the components of a tensor T, the torsion tensor of the connection. (In the 
next chapter, we will see the reason for the term “connection”，but no one seems 
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to have a good explanation for the term “torsion” in this case.) A connection 
r^- is called symmetric if the torsion tensor is zero. In this case, Tjj = 0 for 

every coordinate system, so = r^- for every coordinate system. Conversely, 

if in a set of coordinate systems which cover M, then the connection 

is symmetric. 

The following result gives at least a little geometric significance to symmetry 
of a connection. 


7. PROPOSITION. The torsion tensor 7" of a connection satisfies T(p) = 0 
if and only if there is a coordinate system around p with 

r^(^) = 0 for all 

PROOF. If r&( ； 7) = 0, then r(;?) = 0, so T f ^ = 0 in any coordinate system x\ 
which means that r ’： = r^ a . 

Conversely, suppose that T^j{p) = r^.(^) for all i,j\k in a coordinate sys¬ 
tem x. Define x fk by 

f k (q) = [x k (q) ™ x k (p)] + ^ Tt( P )[Aq)- x^p)] • [x\q) - x\p)l 


Using r^o) = r^o) 


,we compute that 


dx ,k 

dx l 

dx fk 

dx l 


=^ E r f/ ⑻[乂- xi (P)] 
i—\ 

(P) = ^ - 


This shows that x f is a coordinate system in a neighborhood of p. Moreover, 


d 2 x fk 

dx k dx l 


(P) = 


Substituting into (*), we see that T’:〆;?）= 0♦ 

Proposition 7 becomes more significant when we introduce the concept of 
geodesics. In our treatment of Riemannian metrics, we defined geodesics as 
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critical points for the energy function. For a general connection rg ，we can 
simply define a geodesic as a path y satisfying 


d 2 y k 

dt 2 


HI 


O 


Y 


• • 
dy l dy J 

dt dt 


0； 


a calculation shows this condition does not depend on the coordinate system. 
The basic theorems on differential equations show that geodesics through p 
are uniquely determined by their tangent vectors y f (0) e M p \ we can thus 
define exp: M p M as before.* We can also introduce “Riemannian normal 
coordinates” at p\ we choose any basis X\,... ,X n {or M p ， define x - M 71 

by x(2Z?=i al ^i) = ( a 1 ， … ， a n )，and let x = ；( 。 exp -1 . As in Proposition 4-1， 
we note that the geodesic y k (t) = ^ k t satisfies 

n 

ij=^ 

so that 

n 

for all / 2 -tuples (f 1 ， .. •，^ n ). 

This implies that 

咖)+咖)= 0; 

if we also have T (p) = 0, then we deduce that Tfj{p) = 0. 

We define covariant derivatives of tensors and the curvature tensor R for a 
connection by the formulas on pages 210 and 214. Notice that Proposition 1 
holds for a symmetric connection, while Proposition 2 holds for any connection. 
Naturally, Proposition 3 has no analogue for general connections. For non-sym- 
metric connections, the comparison of mixed covariant derivatives becomes a 
little more complicated. Recall that for a function /: M —► M we have 

f — a 2 / A df 
乂 " _ dx^xJ jr 

v=\ 

When the connection is not symmetric, we can express f'” 一 fjt in terms of 
the torsion tensor, which is also involved in the new Ricci identities. 


* Many global results about geodesics for a Riemannian metric do not hold for more 
general connections ； see Chapter 8, Addendum 2. 



224 


Chapter 5 


8. PROPOSITION. For a connection r$_，with curvature tensor R and torsion 
tensor T i we have 


n 

hi - f\ii = 

V = 1 


dx v iJ 


n n 

^ l ；jk - ^ l ；kj = - R l i)k + 

l—\ / =1 

n n 

^•i;jk ~ ^i ;kj = A./ Rf ijk + ;l ^lk _ 

/ =1 / =1 

PROOF. Compute, compute. ♦:. 


(For all these relations it is, of course, extremely important that, in the defini¬ 
tions, care is given to the order of the subscripts in the r’s_) 

Finally，we wish to consider the properties of the curvature tensor which are 
given in Proposition 4-10. Two of these have no analogue for a general connec¬ 
tion, since they involve a metric, but there is an additional relation, involving 
the covariant derivative, which holds for any connection. 


9. PROPOSITION. The curvature tensor for any connection satisfies the fol¬ 
lowing identities: 

( 1 ) R l jki = -R l jik 

(2) (Bianchi’s first identity) 

jkl + klj + R l Ijk 

n 

=(‘ + ‘ + ‘) + T u T lj + T u T U) 

m=i 

(3) (Bianchi’s second identity) 

ijk\l + ikl\j + R h ilj;k) 

n 

R h i^) — 0 . 
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In particular，if the connection is symmetric，then we have the much simpler 
relations 

(20 + + RUjk = o 

(3’ ） R h ijk\l + R h ikl\j + R h ilj\k = 0. 

Classically ，（ 3’）alone is known as “Bianchi’s identity”. 

PROOF. Equation (1) follows immediately from the definition. In the case of a 
symmetric connection, equation (2’）is also easy to verify (we have already done 
it in Proposition 4-10). It is even simpler to verify if we use Riemannian normal 
coordinates at p; in this case the definition (page 214) gives 




dx k 


dx l 


which yields (2’）at once. Using Proposition 1， for a symmetric connection，we 
also obtain 

which gives (3’). 

The proof of (2) and (3) in general is considerably more complicated. To 
begin with, notice that T 1 ^ = — 从 ， so 

E C E + E UL. 








We also have 


T kl 


dx l 


2^ l in 2 jk 一 ~ 2^ 1 ik 1 jh 








From these equations we see that the right side of (2) equals 





//I 




the two terms obtained by 
cyclically permuting j^kj. 


Using the definition of this is easily seen to equal the left side of (2). 
To prove (3)，we note first that (1) gives 

E T rk Rh w = E T jk Rh ^ + r W 

JLt IX 
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We also have 


R h ri 


dR h 


ijk\l 


ijk 


dx l 


e Rh _u Rh ^ 






At 


At 


From these equations we see that the left side of (3) equals 






the two terms obtained by 
cyclically permuting 


Plugging back into the definition, we find that this is zero♦ 


Believe it or not, the Bianchi identity will be useful later on, and crucial 
in the last chapter of Volume V. Even more surprising, in Chapter 7 we will 
present a derivation of the Bianchi identity which will make it seem like a natural 
result. With the present proof of the Bianchi identity we end our summary 
of the classical theory of connections. The presentation was made mainly as 
background for the succeeding chapters, in which the same results will begin to 
take on a more modern appearance. 



CHAPTER 6 

THE V OPERATOR 


f I 1 he contents of this chapter really differ very little from those of the previous 
丄 one，but everything will look quite different. The clean modern symbol¬ 
ism which gets introduced here is abandoned only in those parts of the chapter 
which compare the present treatment with that given previously. This refur¬ 
bishment of the classical theory, due to Koszul, is effected by singling out for 
invariant treatment just one of the concepts introduced previously, and then 
defining the other concepts in terms of it. We will begin with a definition，and 
then compare it to the classical one. 

A (Koszul) connection on a C°° manifold M is a function V (read 
“dell”) which associates a C°° vector field to any two C°° vector 
fields X and F，and which satisfies 

( 1 ) V x ^ X2 Y = V Xl Y^V X2 Y 

( 2 ) yx(Yi^Y 2 ) = V x Y^V x Y 2 
⑶ y f xY = f V x Y 

(4) V x (fY) = f-V x Y^X(f)^Y. 

Notice that V is assumed linear over the C°° Junctions in the argument X. By our 
standard theorem (1.4-2) this means that for any given vector field Y we can 
define 

Vx p YeM p 

for every X p e M p so that 

= p ^ ^x P 

Alternatively, one can define a Koszul connection to be a map V which assigns 
a vector V；^ F e Mp to every X p e M p and every C°° vector field F，and which 
satisfies 

(1，）W = v 々 r + v；^r 

(20 y Xp (X x + y 2 ) = v Xp Yi^v Xp r 2 
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(3 ，） ^aX p y = ci^XpY for all a G M 

(40 (fY) = f(p) .v^r + X p [f).Y p 

(S 7 ) If X and Y are C°° vector fields，then so is ^ ^x p Y- 

We then define by Vx Y(p) = Vx p Y. 

As one example of a Koszul connection, we take M to be M w and let Vx p Y 
be the directional derivative of Y in the direction X p (computed by taking 
the directional derivative of the component functions of Y). It is clear that 
properties (1’) 一 (5’) hold for this V. However, the most important justification 
for the particular conditions required of a Koszul connection comes from a 
comparison with classical connections. If x 1 ,..., is a coordinate system 
on M，and we define by 

(*) ▽上 ‘ 

dx l 




k=z\ 


J dx k ' 


then from (1)—(4) it is an easy exercise to deduce that rg are the components of 
a (classical) connection; conversely, given a classical connection, we can use (*) 
and (1)-(4) to determine a well-defined V. In the coordinate system x, if 


X = 




then 


^xY 



dx k 


For a given vector field Y we have a tensor VF of type (j), that is, a collection 
of linear transformations VK(^) : M p M p , given by 

VY( P )(X p ) = V Xp Y (=V x Y(p)). 


Clearly 


we can also write this as 



dx k 


VY = 


n 






i 9 k=l 


dx k 
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which shows that VF is the tensor classically described in terms of its compo¬ 
nents 入 V 

The covariant derivatives of all other tensors are now going to be defined 
in terms of this covariant derivative, which we have made the cornerstone of 
our new definition of a connection (and not merely in terms of the r 心 which 
it determines). There are two completely different ways of doing this, each of 
which has its advantages. The first way is purely formal: 

1. PROPOSITION. Let X be a C°° vector field on a C 00 manifold M with a 
Koszul connection V. Then there is a unique operator 

A \-^ Vj^^4 

from C°° tensor fields to C°° tensor fields, preserving the type (^), such that 

(1) V x f = X(f) 

(2) Vj^y is the vector field given by the connection V 

(3) A is linear over R 

(4) Vx (^4 ® B) = Vj^^4 ® 5 + ^4 ® Vj^ 5 

(5) For any contraction C, we have Vj^ o C = C o Vj^. 

Moreover，each Vj^^4 is linear over the C°° Junctions in the argument X, so for 
every tensor field A of type (，）and every X p e M p we can define 

^x p Ae T t k {M p ) 

with 

^x p ^x^ A = ^X P A + 

^aX p A = aV Xp A. 

PROOF. Essentially, this is Problem 1.5-15. If we define 

Df 二 X(f) 

DY = V X Y, 

then D does satisfy the condition 

D(fY) = fDY + Df Y i.e” D{f ®Y) = f ® DY + Df ®Y 

which is assumed for this problem. Briefly，the proof that D = V can be 
extended uniquely is as follows. For a 1-form a> we want 

X{a){Y)) = D{a){Y)) = (contraction of ® K) 

=contraction of [Deo ® Y a) ® DY] 

= Do){Y) + a){DY) 

=D(d{Y)+(d{V x Y\ 
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so we want 

(V X (o)(Y) = Da)(Y) = X(a)(Y)) - a)(V x Y) for all Y. 

Since any ^4 is a sum of functions times tensor products of vector fields and 
1 -forms, conditions (3) and (4) determine VxA. 

Following through the proof in detail, it is easily checked that Vj^^4 is linear 
over the C°° functions in the argument X. 


In view of Proposition 1, for any tensor A of type (f) we can define a new 
tensor VA of type (众 j* 1 ) by 

VA{p){X \ p ,..., Xk p , X p ) = Vx p ^{^\p, - • • ,Xk p ). 

a) = kj dx^ is a tensor of type ( 上 )， and we set 

n 

V 3 = 〉 ： a/ dx\ 

l—\ 

then we have 


= V 3 ( contraction of ⑴ 0 —— r 

dxt W 


contraction of 


Z r t^ 


▽ a ⑴⑭ tt + v a tt 


so we obtain 


d7 


k=\ V /x=l 

n 

〉 : ^-k\i dx^. 

k — \ 


We can also write this as 


Vdt> ― 〉: dx^ 0 dx l 


i 5 /c = l 
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which shows that ▽⑴ is the tensor classically described in terms of its compo¬ 
nents Ajt；/. Similarly，we easily see that if 


A = dxh ® … ® 办 , 、 ® 577 ® …公 


dxJi 


then 


VA = dx l{ ® ® dx lk ® dx h ® - ~~ - ® … ® 

… 1 k.M h y7i 


dxh - v dxJl * 

The uniqueness clause in Proposition 1 is precisely what accounts for its use¬ 
fulness: all properties of V^l should be derivable from properties (1)—(5)，since 
these properties characterize V^l. On the other hand, the proposition gives no 
idea what is going on geometrically. To obtain such a picture we introduce 
another extremely important concept. 

First consider a curve c : [a^b] M. By a vector field V along c we mean a 
function V on [a,b] with V t = V(t) e ⑴ • In a coordinate system (x, U) we 



_ • _ 

We call V a C°° vector field along c if the functions v l are C°° on [a,b]; this is 

equivalent to saying that t \-^ Vt(f) is C°° for every C°° function / on M. 
Now suppose that K is a C°° vector field on a neighborhood of c([a, /?]). Then 
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is a C°° vector field along c. This vector field is called the covariant derivative 
of V along c; we will denote it by the convenient symbolism 

DV 

~dt' 

which involves all the classical ambiguities. We would like to generalize this 
covariant derivative along c to vector fields V which are themselves defined 
only along c. 


2. PROPOSITION. There is precisely one operation V \-^ DV/dt, from C°° 
vector fields V along c to C°° vector fields along c, with the following properties: 


(a) 


D{V + W) DV DW 


dt 


dt 


dt 


(b) 


D(fV) 

dt 




f ^ + / 7T for c< 


: [a ， 6] — > M. 


(c) If V s = ⑴ for some C°° vector field Y defined in a neighborhood 

of c(t), then 

DV 

dt 


dt 


PROOF, If x is a coordinate system around p = c(to) then for t sufficiently 
close to to we can write 





c{t) 


for unique functions v J \ If (a), (b), (c) are to hold, then we must have 
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and thus 


DV 

dt 


i: ( 竽 +i:c_ 

k=\ ^ ij = \ 


dd_ 

dt 


v J (0 


) 


dx k 


c{t) 


So there is at most one such operation. Conversely, it is easy to check that this 
formula does have the required properties. 


Remark.. There are two things which should be noted about this Proposition, 
and the interest of the first tends to overshadow the significance of the second. 

(1) The Proposition assigns a value to DV/dt even at points where dc/dt — 0. 
This value is not necessarily 0. In fact, if c is a constant curve, c(t) = p for 
all /, then a vector field V along c is just a curve in M p , and DV/dt is just the 
ordinary derivative of this vector-valued curve. 

(2) When dc/dt ^ 0, so that c is an imbedding in a neighborhood of t, we can 
always write V s = Y c { s ) for some Y defined in a neighborhood of c(t). But Y is 
not unique, so even now condition (c) does not by itself determine DV/dt — we 
also need conditions (a) and (b). This result is very similar to our basic principle 
for defining tensors (Theorem 1.4-2): in the proof of that result we expressed 
all vectors in terms of the d/dx l \ in the present case we express all vector fields 
along c in terms of the d/dx^c^). 

We say that a vector field V along c is parallel along c (with respect to V) if 
DV/dt = 0 along c. When Af = M 71 and V is just the directional derivative, 
we obtain the standard picture of a parallel vector field. In general, given a 


4 


curv T e c: [a, 6] Af, and a vector V a e there is a unique vector field V 

along c which is parallel along c. This is because the equations 

(*) = o 
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are linear differential equations with unique solutions v J \ defined on all of [a, b], 
for given initial conditions; the desired vector field V is then 

v t=fy 、 士 • 

7=1 6x c(t) 

The vector V t e M c ( t ) is said to be obtained from V a by parallel translation 
along c. It is clear from equations (*) that (V-\-W) t = V t -\-W t and (A-F) r =X V t 
for A. E K. We therefore obtain a linear transformation 

T t - ^c{a) ^c{t) Va ^ Vt- 

Clearly, r t is one-one, for its inverse is just parallel translation along the reversed 
portion of c from t to a. Thus, along any curve c we obtain an isomorphism 
between any two tangent spaces M c ( tl ) and M c ⑹； this possibility of compar¬ 
ing, or “connecting”，tangent spaces at different points gives rise to the term 
“connection”. It was invented by Levi-Civita，who used the equations (*) as the 
definition. 

The parallel translation r t is defined in terms of V, but we can also reverse 
the process. 

3. PROPOSITION. Let c be a curve with c(0) == p and c^O) = X p . Then 

^Xp Y ~ lini T{ r h^ 1 ^c(h) — ^pY 
h^o n 

PROOF. Let Vi ， • • • ， T4 be parallel vector fields along c which are linearly in¬ 
dependent at c(0), and hence at all points of c. Set 

n 

， ( 冲 )） =xy ⑴ • K(r). 

/ =i 

Then 

1 i r w ■ 

I™ - (r h ~ l Y c(h) - Y p ) = lim - y^y 1 {h)r h ~ x Vi{h) - }^0)V7(0) 

h^O h h^O h ^ 

L /=l J 

j r n ~ 

=lim - T ⑼ - }^0)V7(0) 

h^O n 

L /=l J 

^ ’ h — >0 fj 

/ = 1 

i =\ ? 一 0 / =1 

= ^x p ❖ 
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Motivated by Proposition 3, we now define Vj^^4 where A is any tensor field 
of type (^). We have 

Z ⑷ G T^(Mq) for all q, 

and the isomorphism 

T t ： 从 ■⑼— ^c{t) 

gives rise to an isomorphism 

炉( 鸠⑼） — ⑴)； 

therefore we can define 

Vjr p A = lim UiT^rh^Aicm- A(p)). 

«—>0 n 


If we regard A(q) as a function of k tangent vectors in M q and / vectors in M q *, 
this means that for V\,.. e M p and 入 i, • • • ，入 / e M p * we have 

• •. ， 以，入 1， … ，入 /) 

= lim |[^(c(/z))(r/ r i; 1 ,..., r h v k , r h *k u ..., r h *k k ) 
h^o h L 

* 


4. PROPOSITION. This Vj^ p ^4 coincides with that given by Proposition 1. 

PROOF. It suffices to prove properties (1)-(5) for the new Vx p A. This is left to 
the reader [the proof of (4) and (5) involves the usual trick which one uses in the 
proof of the product rule for derivatives] ♦ 


[Note that without this result it would not be at all obvious that Vj^ p ^4 is linear 
in A>.] 


5. COROLLARY. Let ^4 be a tensor field of type (f). If Fi,..., are vector 
fields，then 


k 

{Vx p A){Y \ p ,..., Yk p ) = Vx p {A{Y \,..., y)t)) _ ^2 /(&/> ， ••• ， ▽A r P h ， •… ， Ykp). 

1 = 1 

PROOF. Use Proposition 1， noting that A{Y\, • • • ， Y^) can be obtained by ap¬ 
plying k contractions to 

A^Y X ^ -^Y k . 


(It is also instructive to obtain a proof from the definition in terms of parallel 
translations.) ♦ 
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Our definition of VxA may be compared to the definition, in Problem 1.5-14, 
of Lx A. In the latter case, the maps 0% given by the vector field X play 
the roles of the parallel translations r/, in the present instance. Notice that 
Problem 1.5-15 can be used in both cases to formally extend the operation from 
vector fields to arbitrary tensors. 

After these preliminaries, the further study of Koszul connections proceeds 
quite rapidly. We first define, for vector fields X and F, 

T(X,Y) = V x Y -VyX -[X, F]. 

A simple calculation shows that T is linear over the C°° Junctions^ so that it de¬ 
termines a tensor. Clearly this is just the classical torsion tensor; as usual, the 
invariant definition involves a bracket, which disappears in the expression in 
coordinates. 

We now want to distinguish the connection determined by the Christoffel 
symbols for a metric. Suppose we are in a Riemannian manifold (M, 〈 ，》）• 
We will call a connection compatible with ( , ) if the parallel translations 
x t : M c {a) -> M c (j) along any curve c : [a,b] M are isometries (with respect 
to 〈， )c{a) and 〈 ，) c (/)). 


6 . LEMMA. A connection V is compatible with a metric ( , ) if and only if it 
satisfies the following condition: If V and W are vector fields along any curve c, 


then 


PROOF. 

have 


d T IDV 

，，叫 I 


,W) + [V, 


DW 

dt 


Suppose V satisfies this condition. Then if V is parallel along c we 


d 

Jt^ V) 



so (V, V) is constant along c. Thus each x t is norm preserving, and hence an 
isometry. 

Conversely, suppose V is compatible with the metric. Choose parallel vector 
fields P\,.... P n along c which are orthonormal at one point of c, and hence 
at every point of c. Let 


n n 

/ =1 

Then 

n 

(v. w) 
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and by Proposition 2 we have，remembering that DPi/dt — 0, 


So 


DV 

dt 


dt 


DW _ A dwJ’ 

dt dt 7 

y=i 





7. COROLLARY. The connection V is compatible with 〈 ， 〉 if and only if 

X P {Y,Z) = {V Xp Y,Z p )^ (Y p ^ Xp Z) 
for all vector fields Y, Z and vectors X p € M p . 

PROOF. Apply the Lemma to a curve c with c^O) =： X p . 


8 . LEMMA (FUNDAMENTAL LEMMA OF RIEMANNIAN GEOME¬ 
TRY). On a Riemannian manifold (M, { , )) there is a unique symmetric 
connection compatible with 〈 ，〉• 


PROOF. Suppose V is compatible with 〈 ，〉• Choose a coordinate system 
(x, U). By Corollary 7 

{ 入 hjk _ 3 / 3 3 \ 1^ a a \ / a a \ 

Cyclically permuting i, j\k, and using 


we obtain 



(from symmetry), 


n 


jy! 舞 十 ― 


dx l 


dxj ’ dx k 


[(/， 众]， 


which implies that 

Thus the r’s for ▽ must be the Christoffel symbols. 

We know that the Christoffel symbols do indeed satisfy (*), which shows that 
the equation in Corollary 7 does hold. (In fact, this equation is equivalent to 
Ricci’s Lemma, 5-3.) ♦:. 
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The unique connection of Lemma 8 obviously ought to be called the Christof- 
fel connection for < ，〉； instead，it is called the Levi-Civita connection* for 
< ，》！ Naturally, Lemma 8 is more impressive before reading the proof than 
after. Nevertheless, it is still a very nice result. Perhaps its only defect is the 
restriction to symmetric connections; in Addendum 1 to this chapter we present 
some justification for this restriction. We have already given one interpretation 
of symmetry; but the following will be more useful for present purposes. For 
a C°° function s: M 2 M (a ‘‘parameterized surface’’ in M)，we define a 
vector field V along s to be a function V with 


In particular, we have the vector fields 



For any C°° vector field V along s, we define 


V J(x,y) 


covariant derivative along c{t) = s(t, y) 
of r V(t,y), evaluated at ， = x ， 



and we define DV/dy similarly. 


8 . PROPOSITION. If V is symmetric, then 

D ds _ D ds 
3.v dy dy dx 

PROOF. Express both sides in terms of a coordinate system and compute♦ 

* This is an historical mix-up, due to the fact that the collection of parallel translations 
determined by V was called “Le' i-Civita’s coniiection” for V. 
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We are now ready to define the curvature tensor for a Koszul connection V. 
If X, Y, Z are vector fields，we consider the vector field 

R(x, Y)Z = Vjr(VrZ)- Vr(VjrZ)- V [jr , r] Z. 

A straightforward computation shows that (because of the bracket term) R is 
linear over the C°° functions in all three variables，so it defines a tensor i?，the 
curvature tensor of V. Obviously, this definition is somehow related to the 
Ricci identity (Proposition 5-8) 


n 


n 


(*) 


^ l 'Jk ~ ^;kj 


Rl[ j k + Y^ l .J T ;k 


but at first sight we seem to be missing a term for the torsion. To see why this 
is so, recall that if 


n 


z =[ 入 ’ 


/ =1 




then X l j are the components of VZ ， i.e.. 


n 


▽ 丄 z 入 ，; 

dx^ i=\ 


J 


dx r 


Now l!'jk = ^ l ； j；k are not the components of V^ x k (V a/a；c； Z); rather they are 
the components of V(VZ). So 


n 




jfc 


3/ 


vvz 


dx^ ? dx k 


▽ 丄 (vz) 

l dx k 

V P fvz 


d.x 


k 


dxJ 


)) - 


VZ V 


3.v 


k 


dxJ 


by Corollary 5 


V , V , Z - V 


dx k v d.x J 


V 


z. 




dx J 
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Consequently, 


▽3 ( v a z ) - v a ( v a z 

dx J ^ dx k / dx k ^ dx J y 




d^ 


n , n 

EE 

/ =i v /=i 


3x 7 


)J R[ nk — 


fy ， 说 


▽/« 


(t-i) 


by (4 


Since 


V/ « 


^ T kjl) 

/ =i 3x , / 


z = J 2 T l^ z =Z T l^ 


ij=\ 


dx l 


we obtain simply 


7 丄(▽丄 Z )_ ▽丄（▽丄 Z 

dx j dx k dx k ^ dx j ， 


n , n 

E(E 

/ =i v /^i 


入％ 


(d d \ 

= R {d^J^) z, 

so the definition does agree with the classical one. At the same time, it is clearly 
preferable, in that it does not involve the torsion. 

[Classically, there is practically no way to even name the quantity 

▽丄(▽丄 Z )， 

dx j V 3.v^ ' 

since k^jcj is automatically interpreted as the components of the tensor VVZ. 
For a vector field Y = b l d/dx\ we can write the components of 


▽ 3 (VyZ) as 


/ 1 


the summation over / making it clear that we are taking covariant derivatives of 
the tensor field with components \x l = 乂 1 \lb l . However, for the special case 


▽a Z ， 


the expression 

is about the best we can do.] 


3.v 7 ^ 3a j 


(P A ) 
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The following result will be our analogue of the Ricci identities. 

9. PROPOSITION. Let s: 1R 2 — M be a parameterized surface, and V a C°° 
vector field along s. Then 

D D D D (ds ds\ lr 

dx dy dy dx \ 5 dy J 

PROOF. Compute in a coordinate system. ♦ 

It should come as no surprise to learn that we can now prove the Test Case; 
it may be surprising, however, to see how simple the proof becomes. 

10. THEOREM (THE TEST CASE ； THIRD VERSION). Let (M, ( , }) 
be an / 卜 dimensional Riemannian manifold for which the curvature tensor R 
(for the Levi-Civita connection) is 0. Then M is locally isometric to with its 
usual Riemannian metric. 

PROOF. We assume we are in with y 1 ,... the standard coordinate 
system. 

Step 1. We claim that we can find vector fields X, with arbitrary values A"(0) € 
R n o, satisfying 

V 3 A" = 0 for all /, 

a? 

and hence 

▽ z A" = 0 for all Z • 

To do this we first choose X(y, 0, • • • ， 0)，with the desired initial value，so that 
it is parallel along the > ?1 -axis. 




242 


Chapter 6 


For each fixed y l , we then choose X(y l , y\ 0, … ， 0)，with the values X(y l ,0, 
… ， 0) just obtained, so that X is parallel along the curves y i-> ( 少 1 ， 少， 0, ... ， 0). 
The vector field X is now defined on the surface 


以少 1 ，少 2 )= (少 \ 少 2 ,0，...，0). 


Clearly DX/dy 2 is 0 along s, while DX/dy l is 0 along 0)}. Now we have 

dy l dy 2 dy 2 dy l ySj； 19 dy 2 

so 

D D 

-—-—= 0 . 

oy 2 dy x 

This means that DX/dy x is parallel along the curves y j；). Since 

DX/dy x is 0 at s ( 少 ^O)，we have DX/dy x = 0 along 
We can clearly continue in this way to obtain the desired X. Now choose 
X\,..., X n with this property so that ^(0),..., A^(0) are orthonormal with 
respect to ( ， 〉 o. Clearly, X\,. .. ,X n are linearly independent in a neighbor¬ 
hood of 0. 

Step 2. Since the connection V associated with ( , ) is symmetric, we have 

0 = ▽々不 一[不, A}]. 

But Vz^i = 0 for all Z. So [Xi, Xj] = 0. This means that there is a coordinate 
system a 1 , ..., x n with Xi = d/d.x 1 . 

Step 3. We claim that .y is the desired coordinate system, i.e., that the Xi are 
everywhere orthonormal. This is obvious, for they are orthonormal at 0 and 
parallel along any curve, and parallel translation preserves the inner product 
(,)• ❖ 
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This proof is completely analogous to the first two proofs of the Test Case ， 
except that it is “dual” to them in the sense that we find the vector fields d/dx l 
instead of the 1 -forms dx l . 

Step 1 uses the conditions R = 0 to obtain the parallel vector fields Xi, in 
a way completely analogous to, but simpler than ， Step 1 in the second 
version. 

Step 2 now uses symmetry of the connection to prove that [A/, Ay] =： 0, a 
condition which is dual to exactness of the 1 -forms obtained in the 
previous versions. 

Step 3 uses the definition of the Levi-Civita connection to prove that the vec¬ 
tors Xi are everywhere orthonormal. As we have already pointed out, 
the fact that parallel translation preserves the inner product 〈， } is 
equivalent to Ricci’s Lemma，which we used in Step 3 of the second 
version. 

Notice that the proof shows that if we can find n everywhere linearly indepen¬ 
dent vector fields Xi，• •. ， X n , which are parallel (i.e.，which satisfy VzA/ = 0 
for all Z)，then the manifold is flat. Consequently，such vector fields generally 
cannot be found. This means that parallel translation of a vector along two dif¬ 
ferent curves with the same end-points generally gives different results; for if X p 
were the parallel translate of Ao along both the curves shown below, we would 
clearly have both 


V 3 ^(p) = 0 and V 3 ^(p) — 0 . 

a? a? 


This phenomenon can also be described by saying that parallel translation of 
a vector along a closed curve generally brings it back to a different vector. In 
Volume III we will see some more quantitative statements of this fact. 

Our next two results are simply Proposition 5-9 and 4-10. They are reproved 
here in a spirit more in keeping with the present treatment of connections, but 
it is not hard to see that they are essentially the proofs given before. As a matter 
of notation, if we are given an expression A(X, F ， Z)，we will let SA(X 7 Y y Z) 
denote the cyclic sum 

5A(X 7 r,Z) = A(X 7 Y,Z) + A(V 7 Z,X)-h A(Z,X, Y). 
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11. PROPOSITION. Let V be a connection with torsion T and curvature R. 
Then for all vector fields X, Y, Z, W we have 

(1) R(X, Y)Z = -R(Y,X)Z 

(2) (Bianchi’s first identity) 

Y)Z} = 3{(V X T)(Y,Z)} + 5{T(T(X, Y)Z)} 

(3) (Bianchi’s second identity) 

+ Z{R(T(X,Y\ Z)W} = 0. 

In particular, if T = 0, then 

(20 Z{R(X, Y)Z} = 0 
(30 3{(W Z R)(X,Y,W)} = 0. 

PROOF. (1) is clear from the definition. 

To prove (2)，we first note that 

T{T{X,Y\Z) = T{V X Y,Z)~ T(V y X ， Z) - 7([X,F],Z) 

= T(W X Y,Z) + T(Z, V y X)- T([X, Y],Z). 

We also have, by Corollary 5, 

(v z r)(x, n = v z (r(x, Y))~ r(v z x, Y) - T{x, v z y). 

From these equations we obtain 

Z{T(T(X,Y\Z)} = -3{(v z r)(x, r)} + S{v z (r(x ， n) — T([x, r], z)}. 

The second term on the right side equals 

S{Vjr(V y Z)- Vy(VjrZ)- V [jr , y] Z} + S{[[X, Y],Z]} = Z{R(X, F)Z} + 0, 

since the Jacobi identity states that 3{[[X, F], Z]} =0. 

To prove (3)，we first use (1) to obtain 

Z{R(T{X, Y),Z)W} 

=Z{R(V X Y,Z)W + R(Z,V Y X)W - R([X, Y],Z)W} 

=Z{R(V z X, Y)W + R(X ， V Z Y)W}~ Z{R{[X, F], Z)W}. 
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By Corollary 5 we also have 

(S/ Z R)(X,Y,W) = S/ Z (R(X, Y)W)- R(X, V)V Z W 

- R{V Z X, Y)W - R(X, V Z V)W. 

From these equations we have 

S{R(T(X,V), Z)W} = ~S{(V z R)(X 9 r,W)} 

+ 3{V Z (R(X, Y)W)~ R(X, Y)V Z W 
-R([X,Y],Z)W}. 

Now 

▽ z ( 及 (X ， Y)W)- R(X, Y)V Z W - R([X,Yl Z)W 

— V^Vj^Vy VT — Vj^VT — 

- Vj^VyV^W + VyVj^V^W 7 + (V[£^y] 
r ] V[jr,r] W 7 

Writing [V^ ， Vy] for the operation W Vj^ Vy W — Vy Vj^ W (as on pg. 1.155), 
we can write this as 

▽/([▽jit, ^y]W) - [Vjit, Vy](V z W) + ▽[[m ， z]W 7 . 

The cyclic sum of these quantities is 0, because of the Jacobi identity 

su [ 义 , n,z]} = o, 

and the “Jacobi identity” 

3{[Vz, [Vjir, Vy]]iy} = 0 

(recall that in any ring, if we define [a, b\ 二 ab — ba ， then [ , ] satisfies the 
Jacobi identity). ♦ 

12. PROPOSITION. For the curvature tensor R of the Levi-Civita connec¬ 
tion V associated to a Riemannian metric ( , } we also have 

(1) {R(X i Y)Z,W} = ~{R(X,Y)W,Z) 

(2) {R(X, V)Z,W) = (R(Z, W)X,V). 
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PROOF. Equation (1) is equivalent to 

{R(X, Y)Z, Z)=0 for all X, Y, Z. 

It suffices to prove this when [X, Y] = 0. In this case 

(R(X, Y)Z,Z) = (Vjr(V r Z)- V Y (V X ZIZ), 

so we must show that (Vj^(Vj^Z), Z) is symmetric in X and Y. 
Now YX (Z, Z) is symmetric in X and F, since [X, Y] = 0. But 

X{Z, Z) = 2{VxZ, Z), 


so 

YX(Z,Z) = 2(V r VjrZ,Z> + 2(V^Z, V r Z>. 

Since the right-most term is symmetric in X and F, so is Z). 

Equation (2) follows from (1), Proposition 12, and Proposition 4-11♦ 


To complete our treatment of Koszul connections, we define a geodesic for V 
to be a path y : [a, b] ^ M with 


dt dt 


thus, the tangent vector dy /dt must be parallel along y. In the coordinate 
system x we immediately obtain the equations for a geodesic, 


2..k 


d z y 

~dt^ 


n 


E r 5_ 

i，j = ' 


dy 1 dy 


dt dt 


0 . 


The existence of a unique geodesic, with given initial vector y’(0) € M p follows 
immediately. Since parallel translation is an isomorphism, the tangent vector 
dy /dt of a geodesic y is nowhere zero (except when y is a constant path). If V 
is any vector field along y, then we can write 

V 年 f: 

dt 

and 

DV df dy D dy df dy 

dt dt dt dt dt dt dt 
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This is 0 only if / is linear. Consequently, a reparameterization y = y o p oi y 
is also a geodesic only if p is linear.* 

We know that this result can be made more precise for the Levi-Civita con¬ 
nection V associated to a metric ( , }, and the proof of the more precise result 
is now especially easy: We have 

^y_\ = 2 lR^L ^Z\ = 0 

dt \ dt ' dt I \dt dt ’ dt 1 ’ 

so \\dy/dt\\ is constant, i.e., y is parameterized proportionally to arclength 
(Theorem 1.9-12). The reader is invited to investigate how the proof of Gauss’ 
Lemma (1.9-15) is simplified in the present set up. We will complete our study 
of the V operator by providing the invariant description of the First Variation 
Formula promised so long ago. 


13. THEOREM (FIRST VARIATION FORMULA). Let y: [a,b] ^ M be 
a piecewise C°° path and a : (—e, e) x [a,b] ^ M a. variation. Let 


W t 


Vt 


da 
du 
dy 
~dt' 


(0,0, 


the “variation vector field” 
the ‘‘velocity vector of y” 


A t = — V t ^ the “acceleration vector of y”. 

at 

Also choose a = to <•—< t]\f = b to include all discontinuity points of V, and 
set 


Ar,. V - V{ti^) - V(tr) i=\， …， N — \ 

A t0 V = V(t 0 ^) 

^t N V = -V{tN~)- 

Then 

dE(a(u)) =_ f\ WtMl}dr ^J^ {WtnAtjVl 

PROOF. We will give the proof when V has no discontinuities, leaving to the 
reader the simple auxiliary argument for the general case. 


* Again we point out that certain global theorems about geodesics for Riemannian 
metrics do not generalize to arbitrary connections; see Chapter 8, Addendum 2. 



248 


Chapter 6 


From 


3 / da da 


D da da 
du dt 9 dt 


we obtain 


dE(a(u)) 

du 


d f b I da da 


2 du 


dt ? dt 


f b ID da da 
l a \dt du’ dt 


>b I D da da 
,\du dt ' dt 


by Proposition 9. 


Now the identity 


d Ida da 
dt \ du' dt 


D da da 
dt du' dt 


da D da 
du 9 3^ dt 


implies the following analogue of integration by parts: 


f b lDda da 
f a {dt dt ’ dt 


da 3of \ 
du’ dt j t 


t=b 


>b Ida D da 
,\du' dt dt 


Thus 


dE{a{u)) 


—/ {W t ,A t )dt- (W(al V{a)) + {W{b\ V{b)), 


which is the desired formula. 
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ADDENDUM 1 

CONNECTIONS WITH THE SAME GEODESICS 

Let V and ▽ be two connections on M, We define 

D(X i Y) = V x Y -V X Y, 

A simple computation shows that D is linear over the C°° functions in both argu- 
ments，so it determines a tensor /), the difference tensor of the two connections. 
In a coordinate system x we have 

As is well known, there is a unique way to write D = S -\- A with S symmetric 
and A alternating, namely 

S(X,Y) = \[D(XJ) + D(Y,X)\ 

A(X, Y) = \[D{X,Y)- D{Y,X)}. 

Note that if T and T are the torsion tensors of V and V, then 

(*) 2A(X, Y) = V X Y -V X Y - V Y X + V Y X 

=T(X, Y) + [^r, Y] - T(X, Y) - [^r, Y] 

= T(X, Y) - T(X, Y\ 

so V and ▽ have the same torsion if and only if ^4 = 0. 

14. PROPOSITION. The following are equivalent: 

(a) The connections V and ▽ have the same geodesics [with the same pa- 
rameterizations] 

(b) D{X, ^)=0 for all X 

(c) S = 0. 

PROOF, (a) (b): Given 0 ^ X p e let y be the geodesic (for the connec¬ 
tions V and V) with = X p . Let ^ be a vector field in a neighborhood 

of p which equals dy/dt along the part of y in the neighborhood. Then 

D(Xp,X p ) = Vx p X - NxpX = 0 + 0. 
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(b) 4 (a): Let ^ be a geodesic for V, and let A" be a vector field which equals 
dyjdt along a portion of y. Then 

^x p X = D(X P ,X P ) + V^X = 0 + 0, 

which shows that ^ is a geodesic for V. 

(b) (c): D(X, X) = 0 S(A", X) = 0. The latter condition implies that 

S = 0, for we have 

0 = s(x -^y,x -hY) = s(x,x)-h s(r,r) + s(x, r) + s(y,x) 

= 2S(X,Y). ❖ 

15. COROLLARY. If the connections V and V have the same geodesics and 
the same torsion, then V = V. 

16. COROLLARY. For every connection V, there is a unique connection V 
with the same geodesics and with torsion 0. 

PROOF. Uniqueness follows from Corollary 16. For existence we define 
=VxY — ^T(X, F), checking easily that V is a connection. Since T is skew- 
symmetric, D = must simply be 4， so S = 0, so ▽ and V have the same 
geodesics. Also, T = T — 2^4 = 0, so V has torsion 

We thus see that if we divide all connections into equivalence classes, putting 
all connections with the same geodesics in the same class, then each class has 
exactly one connection with zero torsion. We can also say exactly how large 
each class is: If ▽ is a connection with torsion 0 and 7^ is a skew-symmetric 
tensor of type (^), then there is a connection V in the same class as V, but with 

torsion 7\ namelv 

✓ 

V x Y = V x Y^ l -T(X,Y). 

It is also of interest to inquire when two connections V and ▽ have the same 
geodesics, with possibly different parameterizations. Such connections are called 
“projectively equivalent”，because in ordinary space a projective map is one 
that takes straight lines to straight lines. 
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17. PROPOSITION (H. WEYL ； 1921). The following are equivalent: 

(a) The connections V and V have the same geodesics，with possibly differ¬ 
ent parameterizations. 

(b) For every X there is kx with D(X^ X) = kx ' X. 

(c) There is a (unique) 1-form a> with S(X, Y) = a>{X)Y + a>{Y)X. 

•⑻ 4 (b): Given X p g M, let y be the geodesic for the connection V 
with y f = X p ^ and let y be the reparameterization which makes it a geodesic 

for V. Let X be a vector field which equals dy jdt along y and X a vector field 
which equals dy/dt along y. Then along y we have X = fX for some /. So 

D(X p ,X p ) = V Xp X^V Xp X 

^Ap)[W)^ P ^f(pWx p ^] 

=f{p)X p {f) ^ X p = X p (f)X p . 


(b) (a): Let y be a geodesic for V, and let X be a vector field which equals 

dy /dt along y. Then for p = y(t) we have 

⑴ = D{X p ,X p ) + Vx p X = 'kxp^p = g(t)X p , say. 

Let f(t) = e G ^ ^ 0, where G’(t) = g(t) so that 

df 


( 2 ) 


dt 




and let X be a vector field such that 


(3) 

- 1 
x yU) = Y(t) Xv{t) 

1 dy 
~ f{t) dt * 

Then 





^^ = J^ Xp l X 



1 

~ no 

Xp {j) Xp+ 



i 

= W) 

" df/dt 

-W) g{t)Xp 


fit) 


by (1) 


fM] by ⑵ 
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Consequently, ^ is a geodesic for V if we reparameterize it as ^ y 。 p~ x , 
where p is chosen so that 

dy ^ 

= x vit). 


dt 


i.e., so that 


(P~ x Y{t) 


dy 

dt 


p 


1 dy 

i(/> ~ fiP~\t))~di 


by ⑶‘ 


p~HO 


For this we need 

1 _ 1 

p f {p~\t)) = /(rUO )， 

or simply p\s ) : = f{s) > 0 (compare Problem 1.9-27). 
(c) => (b) is clear. 

(b) => (c): We first establish the following algebraic 


LEMMA. Let F be a vector space, and S : V x V ^ V 2 i symmetric bilinear 
map such that for each v e V there is k v e M with 

S(v, v) = X v v. 

Then there is a unique (p e V* such that 

S(u, w) = (p(v)w + 

PROOF. If 0 exists, clearly <p{v) = k v /2 for i; # 0. Conversely, if this definition 
makes 0 linear we will be done, for then 

S(v, i;) + S(w, w) + 2S(u, w) = S(v -h w,v w) — A v+U ； (i; + w) 

=( 入 v + 入 u；)(u + 切)， 


2<p(v)v + 2<p{w)w2S(v, w) = 2<p(v)v + 2<p(w)w + 2<p(v)w + 2(p(w)v, 
which yields 

S(v, w) + (p(v)w + (p(w)v. 

\Ve prove that 0 is linear as follows. From 


入 v+u ； (i’ + w) = + k w w + 2S (v, w) 

X v —id (u — w) = XyV -j- Xujiv _ 2S (u, w) 
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we obtain 


(Av+iy + ^" V—w — 2 入 + (入 r + y ； 一 入 u — id — 2 入出)1/^ = 0. 


For linearly independent v and w we thus have 


入 v+w 
入 D + W 


+ 入 l>— — 2 A，u = 0 
—入 d — u ; — 2 人出二0, 


and hence A v+iy = k v + k w . This is clearly also true if v and w are linearly 
dependent. Homogeneity is likewise trivial. Q.E.D. 

When V is «-dimensional，we have the explicit formula 

(*) <p(v) = —-— trace of u; i-^ *S(u, w) y 

n + 1 

which can be deduced as follows. Choose a basis Vi” ..，v n with v = V\. Then 

S(u l5 uy) = 0(ui)uy +00/)01 ， 


so 

n 

trace of u; , w) = 7 th component of <p(v\)vj + </>(Vj)v\； 

this component is (p(v\) for 7 ^ 1 7 and 2(p(v\) for j = \ . 

Formula (*) clearly shows that 

a)(X) = trace Y S(X, Y) 
n + 1 

is the desired C°° 1 -form on M. 


18. COROLLARY. The connections V and V have the same torsion and 
the same geodesics (suitably reparameterized) if and only if there is a (unique) 
1 -form a) with 

D(X.Y) = (o(X)Y + (o{Y)X. 

Note, by the way, that for any connection V and any 1 -form a), the function 

V X Y = V X Y -h(o(X)Y -h(o(Y)X 
always is a connection，with the same torsion as V. 
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ADDENDUM 2 

RIEMANN’S INVARIANT DEFINITION 
OF THE CURVATURE TENSOR 

In part C of Chapter 4 we presented an extract from Riemann’s paper of 
1861 . Our use of “ … near the end was rather deceitful, because one of 
the omissions was by no means a minor one. In the deleted portion Riemann 
gives a result, which，although it plays no further role in our development of 
Riemannian geometry, is nevertheless extremely interesting, for it amounts to 
another invariant definition of the curv ature tensor We therefore give below 
an unabridged version of the second part of the extract, beginning with the last 
paragraph in the version in Chapter 4C. It will present greater difficulties of 
interpretation than any thing else we have read，and is followed immediately by 
an exposition in modern terms. 

The functions g\j must necessarily satisfy these equations whenever 

JlQijdy 1 can be transformed into the form ^{dx 1 ) 2 : we denote the left 
Uj . i 

side of this equation by 

To make the nature of this equation more transparent, we form the 
expression 

33 ^ ^ q jj dyi dyj — E gij dy l 8y J + dd^gijdy 1 8y\ 
the variations of the second order d 2 , dd ， 8 2 being so determined that 

^ ^9ijdy l 8y J - d^gijdy 1 8 r y j - d^gij 8y l 8 r y J = 0 
S’ Y^gijdy 1 dy ] - 2d Y^Qijdy 1 / ^0 

8 r denoting an arbitrary variation. In this way the above expression becomes 
(II) = ^ (ij ， kl)(dy l 8yj — dyJ 8y l )(dy k — dy l 8y k ). 

Now from the formation of this expression it is immediately evident that 
a change of the independent variables changes it into a new form depending in 
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the same way on the dx l dxK And if the quantities g r j are constant，all 

coefficients of the expression (II) turn out to be equal to 0. Thus if 9ij dy l dy] 
can be transformed into a similar expression with constant coefficients, it is 
necessary that expression (II) vanishes identically. 

In the same way it turns out that if the expression (II) does not vanish，the 
expression 

t Y] (ij y kl)(dy l dy j - dy J 8y l )(dy k 8y l - dy l 8y k ) 

(HI) -I ——— : --- —2 

J^Qijdy 1 dy J ^Oijdy 1 V - (J2 gi J dyl 8yJ ) 

does not change if the independent variables are changed, and moreover remains 
unchanged if in place of the variations dy l ， 8y l y arbitrary independent linear 
expressions of them, ady 1 + /38y l y ydy 1 + 8y l are substituted. Moreover, the 
maximum and minimum values of the function (III) of the same dy 1 ， 8y l depend 
neither on the form of the expression ^gijdy 1 dy J nor on the values of the 

variations dy 1 ， 8y l y whence from these values it can be determined when two 
expressions of this kind can be transformed into each other. 

These interpretations can be illustrated by what one might call a geometrical 
interpretation ， which, although it depends on unusual conceptions, it will 
nevertheless help，as one goes along, to have touched upon. 

The expression V^gij dy l dy^ can be regarded as just the line element in a 
generalized space of n dimensions transcending our intuition. If in this space at 
the point (y 1 ,... ,y n ) all shortest lines are drawn，in which the initial elements 
of variation of the y l are a dy 1 + /3 dy 1 : a dy 2 + (3Sy 2 : :a dy n + /3 8y 7 \ a 

and P denoting arbitrary quantities, these lines make up a surface which can be 
developed in the space of our common intuition. In this way the expression (III) 
will measure the curvature of this surface at the point (y 1 ,... ,y n ). 

If we now return to the case n 二 3, the expression (II) is a form of the second 
degree in 

dy^ 8y^ — dy^ 8y 2 y dy^ dy 1 — dy 1 dy 1 — dy 2 Sy 1 ， 

and in this case we obtain six equations, which the functions g；j are required to 
satisfy if gij dy } dy) can be transformed into a form with constant coefficients. 
Given an acquaintance with the traditional methods, it is demonstrated without 
difficulty that these six conditions, when they are satisfied, suffice. It is to be 
observed nevertheless that only three of them are independent. 
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Riemann’s description of the curvature tensor is particularly difficult to deci¬ 
pher because he is using classical notation from the calculus of variations. To 
state it in modern terms, consider a function s: M 2 ^ A/, with ^(0) = p e M. 
This function s can be thought of as a “2-parameter variation of the point 
and on M it gives rise to 2 vector fields, 

3 ^ ( 3 



3^ 

3 少 




(訂 


This notation involves the usual ambiguities, and in conformity with this, the 
function 

I ds 3^ 

(X ， 少） 卜 （ (x, ^), — (x, 


on R 2 will be denoted simply by 


dx 


3 ^ 3 ^ 


dx 


dx' dx 

Riemann directs us to consider the expression 

/ . 3 2 I ds 3^ \ 3 2 / 3^ 3^ 

㈧ ' 


) 


3 I ds ds 


dx 2 \ dy’ dy I dxdy \3x ， dy j dy 2 \ 3x ’ 3x 


If we let 


X 


ds 



3 夕 


then the value of this expression at (x ， 少 ) =(0,0) can be written 

X p (X((Y,Y)))~ 2X p {Y{{X,Y)))^ Y p (Y({X, X»), 

or simply 

(B) [XX{Y,Y)~ 2XY(XJ) + YY{X, X)](p). 


On the other hand, to be certain that the expression (B) will equal (A) for some 
I 2 — M ， we need to know that [X, Y] = 0. Riemann does not consider 
all s: M 2 M, but only those with the following property: If cr: M 3 ^ M is 
cr(.v, v,0) = s{x,y), then 

3 Ida 3cr\ I d da 3cr\ 


3.x dy ? dz / 


3 Ida 3cr\ 3 Ida 3cr\ 
dz \ dx ' 3.v / ^dx \ dx dz /。 


d Ido da 
dz \ 3 v ? 3 v 



> at (0,0,0). 
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Riemann claims that if ^: M 2 A/ has this property, then the value of (A) at 
( 义，少） =(0,0) is —2{R(X p , Yp)Y p , X p ). (To account for the factor of —2 recall 
that Riemann’s (" ， kl) equals Rfjik/2 = —RijklH and note that Riemann has 

(dy f A dy^) 0 {dy k A dy l ) instead of dy l 0 0 dy k 0 dy’ •) This assertion 

can be rephrased as follows. 

19. THEOREM (RIEMANN). For any linearly independent vectors X p ,Y p e 
M p there are vector fields X, Y extending them such that 

(a) [ 尤 ， n 二 0 

(b) For every vector field Z with [X, Z](p) = [F, Z](p) = 0 we have 

(1) Z{X,Y)-Y(X,Z)-X{Y,Z)=0^ 

(2) Z{X,X) -2X(X,Z) =0 ^ at p. 

(3) z(r,r>-2r(r,z> = o , 

For any such vector fields X and Y we have 

—2(R(X P ， Y P )Y P ,X P ) = [XX(Y,Y)- 2XY{X,Y) + YY{X, X)](p). 
PROOF. Using Corollary 7, we see that equation (2) is equivalent to 

2{V Z X,X) - 2{V x X,Z) - 2(X,V x Z) = 0 at p 

and hence to 

(V x X,Z) = 0 at /?， 

since ^z^(p) ~ Vj^Z(/?) = [X, Z](p) = 0. Thus equation (2) is equivalent to 
(20 V x X(p) = 0. 

Similarly, (3) is equivalent to 

(30 V Y Y(p) = 0. 

Finally, (1) is equivalent to the equality, at /?, 

+ v z r>- v y z>- ( v x y , z )-( y , v x z ) = o : 


and hence to 


(Vj^F + Vy Z) = 0 at p. 
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This is equivalent to 

0 = Vj^FO) + VyX(p) = 2VxY{p), since - VyX = [X, Y] = 0, 

so (1) is equivalent to 

(1 ’） ^xY(p) = ^Y^(p) = o. 

We can obtain such vector fields X and F，with given values X p , Y p g M p , by 
mapping M 2 into M in such a way that the x and y axes go into two geodesics 
in M with ^(3/3^) parallel along the image of the x-axis and ^(3/3x) parallel 
along the image of the ^-axis. 

Now for such vector fields we have 


XX(Y, Y)(p) = 2X(V x Y, Y)(p) = IWxl Y){p) + 2(V^F, V x Y)(p) 

— y)(p) 

YY(X, X)(p) = 2(V r V r X, X)(p) 

-2XY(X, Y)(p) = -2X({V Y X, Y) + {X, VyY))(p) 

=-2{V x VyX, Y)(p)~ 2 {V y X, V x Y)(p) 

- 2(V x X, VyY)(p)- 2(1 ， V x VyY)(p) 

=^(VxVyX, Y)(p) - 2(1 ， V x VyY)(p). 


So the sum is 

2 [(V X V X Y,Y)- {VxVyX^Y)] (p) + 2 [{V Y V Y X, X)- {V x V y Y,X)] (p) 
= 0 + 2 [{VyVxY^X)- (VxVyY^X)] (p) 

since VyX — Vj^F = [X, F] — 0 everywhere 
= 2{R(Y P ,X P )Y P ,X P ) 

= -2{R{X p ,Y p )Y p ,X p ). ❖ 



CHAPTER 7 

THE REPERE MOBILE 
(THE MOVING FRAME) 


T he previous chapter betrayed the historical development which we have 
been following, for the V operator did not appear until very late in the 
game, around 1954 .* In the meantime, Elie Cartan had elaborated a completely 
different theory, the method of the repere mobile. Despite the fact that this 
theory was invented soon after the Ricci calculus, some of its features are most 
easily understood by referring to the V operator which historically came so 
much later. 

Roughly speaking, the relationship between the results of Chapter 5 and those 
of Chapter 6 can be characterized as follows. When working with V operators 
we express results in terms of arbitrary vector fields, while in the classical theory 
we always use the vector fields X t = d/dx l given by a coordinate system (x, U). 
At each point p e U, these vector fields provide us with an ordered basis 

{X x {p\..^X n {p)) for M p . 

In general, an ordered basis (Ui, • • •, v„) for a vector space V will also be called 
a frame in V. Now the vector fields Xi may be used to determine a function 

P ^ (X x (p),...,X n (p)), 

whose values are frames in the various tangent spaces M p \ such a function is 
called a moving frame. Conversely, every moving frame p F p determines 
vector fields X\,. .. ,X n by F p = {X\ (p), • • • ， X„(p)). Consequently, as a matter 
of convenience we will not distinguish between everywhere linearly independent 
vector fields Xi , •…， X n , and the moving frame they determine. In E. Cartan’s 
theory; the basic idea, whose ramifications turn out to be extremely significant, is 
to express everything in terms of an arbitrary moving frame X\,..,, X n , and not 
just in terms of the “natural moving frame” X/ = d/dx l given by a coordinate 
system .y. 

Notice that a moving frame X\,.,. ,X n need not be the natural frame for any 
coordinate system, since we need not have [Xi,Xj] = 0. Also, a moving frame 


*Nomizu ，Invariant affine connections on homogeneous spaces, Amer. J. Math. 76 (1954) ， 33—65. 
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may exist on a region which cannot even be included in a coordinate system. For 
example, there is a moving frame X \, X 2 on the whole torus. On a Riemannian 



manifold (M, ( , )) we define an orthonormal moving frame X\^... ^X n to be 
one such that each (X\ (/?),..., X n {p)) is an orthonormal frame for M p . The 
frame illustrated above, on the torus, is an example. An arbitrary moving frame 
X\,... ,X n gives rise to an orthonormal moving frame if we apply the Gram- 
Schmidt orthonormalization process to it (Problem 1.9-11). 

Given a moving frame A"i,..., on (an open subset U of) M ， we now ask if it 
is possible to describe quantitatively just how the frame is moving. As a guide, we 
first consider a moving frame X\,..., X n in M w . In this case, we can, with a little 
abuse of notation, consider each X( as an M w -valued function A7 : M w ^ M w . If 
we consider the identity map as an M w -valued function P: M w ^ M w , then the 
M w -valued 1-form dP is just 

dP(X a ) ^ X. 

Consequently, if we introduce 1-forms 9 l by 

n n 

(I) dP = • Xi i.e., dP(X a ) = J2 ei ( x a)' A7 ⑷ e M' 

/ =1 / =i 

then 9 l are just the “dual forms” to the A}，that is, 6 l {Xj) = 8 l j. We also 
introduce 1-forms ⑴ j by 

n 

(II) dXj = 

/ =1 

Clearly o) l j{X a ) is the A7 component of dXj{X a ). Since dXj{X a ) is just the 
directional derivative of Xj in the direction X a , we can interpret o) l j{X a ) as the 
rate at which Xj rotates toward A7(a) as we move along a curve with tangent vector X a . 

Now it is not possible to find a moving frame with arbitrary 0 l and a) l j ； certain 
integrability conditions must be satisfied: 
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1. PROPOSITION. The 1-forms 9 f and co l j for a moving frame X\,... ,X n 
in satisfy the structural equations of Euclidean space: 


dO^ = 〉: 9^ a = — 〉: A 0 
k k 

d(i)j — — 〉: 0)^ A (Oj. 


k 


k 


PROOF. Noting that the equation dP = can also be written dP — 

0 l A Xi (for the -valued 0-form X{), we have 


0 = d 2 P = d : A = 〉 : d9 i A Xi — 〉: 9^ a dX^ 


vr 


k 


Y^ d9i Xi — Y^^y^Xi by (II). 


k 


Setting the coefficient of each X\ equal to 0, we obtain the first structural equa¬ 
tion. 

We also have 

0 = d 2 Xj = do)jXi — (Oj A dXk 

i k 

~ XI da)l j Xt - XX a 石， 

i k i 


from which we immediately deduce the second structural equation. ♦ 


The structural equations can be written much more compactly if we use 
slightly modified matrix notation. Henceforth, we will write matrices a.s A = 
(^4y), and define 

( A ， B yj=TA B h 

k 

so that our new A l k corresponds to the old If v = (ui,..., u w ) is an ordered 
«-tuple of vectors, and we define 


= 




Vi, 
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then the w-tuple w = (w\, • • • ， w n ) will be denoted by v • ^4. We put the A on 
the right because we have 

[v • (^4 • B)]j = • B))vt = A k B j^i 

i i k 

== ， A 、 k 

k ^ i ’ k 

=[(V - A) • B]j, 
which we can write simply as* 

\ • (AB) = (\ A) • B. 

Naturally, if X = (X\, •. • ， X n ) is an w-tuple of vector fields on a manifold M ， 

争 

and A = (A l j) is a matrix of functions (or a matrix-valued function, whichever 
way you prefer to look at it), then X - A denotes the w-tuple of vector fields 

(X^) y =E^y^ 

We extend this notation to forms in the natural way. If ⑴ = (o)j) and r] = (";) 
are matrices of forms, of degree k and / ， respectively, then co A r) is the matrix 
of (k + /)-forms 

( ⑴八 "))= 八 r]j. 

k 

If 6 denotes the column vector of /-forms 0 1 ,..., 0 n , then A 0 is the column 
vector of (k + /)-forms 

(a) A oy a 6 j \ 

j 

With these conventions we can write the structural equations of Euclidean space 
as 

d6 = —co a 9 

da) = —co A (O, 

Henceforth, we will use this notation whenever convenient; for a while the 
reader may feel more secure rewriting things in standard form. Our very next 
result justifies our characterization of the structural equations as “integrability 
conditions”. 

* We express this by saying that the n x n matrices act on the right on the set of all 
«-tuples of vectors of K. On the other hand，suppose we choose a fixed basis ,..., 

for V, and then define A - v for v € V by defining A • Vj = AjVi and extending by 
linearity. In this case we will have (A • B) • v = A • (B • v)： so we say that the n x n 
matrices act on the left on V. 
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2. PROPOSITION. Let co = (⑴ ;•）be a matrix of 1-forms on R w which satisfy 
the second structural equations 

da) — —co a co i.e., dco) = — (Q l k A (of. 

k 

Then, 

(1) In a neighborhood of 0 there is a matrix A = (A l j) of functions, with arbi¬ 
trary initial condition 乂 (0)， such that 

dA = —ay a A i.e .， dA) = - y^a) l k A^• 

k 

(2) In a neighborhood of 0 there is a moving frame X\,... ,X n with arbitrary 
initial conditions X\ (0), •.. ， X n (0), so that the dual 1-forms 9 l satisfy the first 
structural equation 

dO — —co a 0 i.e., d9^ = — > : a 9^. 

k 

PROOF. (1) Let 少 1 ，…，尸 be the standard coordinate system on M 71 ，and let 
y x , y n , z l j be the standard coordinate system on Let Z be the matrix 

of functions Z = (zj) and consider the matrix of 1 -forms 

(*) A = dZ (a) A Z). 

We have 

dA = d(o a Z — (o a dZ 

=~(co A co) A Z — co A [A — (co A Z)] by (*) and the hypothesis 
=—CO A A. 

By Proposition 1.7-14, the dimensional distribution 

n 

A p= f] ker 
i,j=\ 

in is integrable. Since △(o， Zo ) = (j， 1 ，• • • ，少 rt )-plane，the integral manifold 

though any point (0, z 0 ) is locally the graph of a function p i-^ A(p) e M w with 
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yl(0) = zo- Since dZ + ( ⑴八 Z) = 0 on this graph, we conclude, as in the proof 
of Theorem 1.10-17, that 

( 氺 dA = —co a A . 

(2) Choose A to be non-singular at 0, and define 1-forms 9 l by 

j 

which we can write simply as 

9 = A a dy. 


Then 

dd = dA a dy 

— a A A dy by (**) 

= — O ) a 9. 

« • 

So we just define the moving frame X\, ..., X n by 0 l {Xj) = Sj. 


For orthonormal moving frames we have one more relation ： 

3. PROPOSITION. The forms a>j for an orthonormal moving frame X\,..., 
X n in M. n satisfy 

⑴ 1 j = 一⑴ / ， 

i.e., the matrix a> is skew-symmetric. 

PROOF. We have 


0 = d( 〈 A ， Xj)) = (dX h Xj) + (X h dXj). 

Here (dXi, X]){X a ) means {dXi{X a ), Xj{a)). Since the are orthogonal, 
clearly {dXi, Xj){X a ) is just the Xj component of dXi{X a ). This means that 

=出/. ♦ 

Now consider a moving frame X\,..., X n on a manifold M. We can still 

* * * • 
define the dual 1-forms 9 l by 9 l (Xj) — Sj; equation (I) can now be rewritten as 

X q = Yii for any X q e M q , or simply dP = Yli^ 1 ' where 

“dP” denotes the identity map of a tangent space into itself. We cannot use 
equation (II) to define the forms (o l j, since “dXY’ makes no sense on a general 
manifold. However, Propositions 1 and 3 suggest a way of defining these forms 
on a Riemannian manifold. 
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4. PROPOSITION. Let ， ... ， be a moving frame on a manifold M, and 
let 9 l be the dual 1-forms. Then there exist unique 1-forms (o l j such that 

(a) co l j = —coj 

(b) de l =： 

k 

PROOF. Suppose ⑴ satisfy (a) and (b). There are unique functions a l j k and 

仏 with 

⑴） = HX〆 ， 

k 

~ \ Ne k ， b、 k = ~b l k j. 

j，k 

Then (a) is equivalent to 

( a， ) a ik = - ai jk, 

while (b) gives 

lH b jk 9j A0k =d0i = H° J Aa) J = Yl a jk eJ A0k ^ 

j,k j j,k 

and hence 

0 3 ’） a jk ~ a kj = ^jk- 

Cyclically permuting z, y, k, we obtain from (a’）and (b ’） 

a jk = \%k + b ki ~ b U^ 

This proves uniqueness. 

Now suppose we define o)j by 

⑴ ) = 12 a jk 0k ^ 

k 

where the a l j k are as defined above, and hence satisfy (b^. It is easy to check 
that equation (a’) holds, and hence that equation (a) holds. Moreover, from 
equation (b’）we have 

⑴ ;• ⑹ - (D l k {X^)^a) k -a l kl =b) k '. 
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it follows that 

Y^o 1 ^(o\(x h x k ) = jy ( ⑽瓜 ) -e ! ( 阳⑴ ㈣) 

l I 

= Y J ^ l {X k )-& l k <o\{X ] ) 

l 

=(o l j(X k ) ~ 4(Xj) 



which is equivalent to equation (b)♦ 


Although we have used Proposition 3 to motivate Proposition 4, the latter re¬ 
sult seems to involve neither a Riemannian metric nor an orthonormal moving 
frame. However the two are, in a sense, really there, since there is a unique Rie¬ 
mannian metric on the domain of the moving frame X\,... ,X n which makes it 
an orthonormal moving frame. Naturally, if we are already given a Riemannian 
metric ( , ) on M, then we will expect the 1-forms o)j to have some significance 
for this metric only when the moving frame is orthonormal with respect to it. 

Let us therefore consider an orthonormal moving frame X\,.. ,,X n on a Rie¬ 
mannian manifold (M, ( , )). The unique Informs (Oj given by Proposition 4 
are called the connection forms for the moving frame X\,..., X n . They satisfy 
0 )j = —axj and the first structural equation, by their very definition. On the 
other hand, there is no reason to expect them to satisfy the second structural 

争 

equation. Recognizing this, we define a matrix of 2-forms Q = (Q)) by 


d(0 = 一 CO A Cl) -|- ^2 


i.e” dco) = 

k 


A aJf 




The 2-forms Qj are called the curvature forms for the orthonormal moving 
frame X\,..., X n . The names “connection forms” and “curvature forms” are 
explained by the very next theorem. Let us set 

n 

v Xi xj = j24 x ^ 

k=\ 

n 

l=\ 

where V is the Levi-Civita connection for ( , ), and R is its curvature tensor. 
We use bold letters T and R to remind ourselves that these are not components 
with respect to a coordinate system; thus, for example, we do not necessarily 
have (but we clearly do have 
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5. THEOREM. Let X\,... ,X n be an orthonormal moving frame on a Rie- 
mannian manifold (M, ( , )) ? and let Q l j be the dual forms, connection 

forms, and curvature forms for this moving frame. Then we have the structural 
equations of (M ,(,〉）： 

dd i = -J^O) i k A0 k 

k 

dco l j = — a) 1 ^ a a)j Qj 

k 

where 

⑴ j = . 沒 ' = ^ l J kl A ^ 1 — ^ 狄 1 jkl 0 k A 6 l . 

k k,l k<l 

PROOF. By the uniqueness part of Proposition 4, we can prove the first struc¬ 
tural equation by showing that if we defined)^ = Y^k ^kj^ k ^ then the ⑴ j do satisfy 
conditions (a) and (b) of that proposition. Now by Corollary 6-7 we have 

0 = X k (X ii Xj) = (Vx.Xi^Xj)-^- (Xi,V Xk Xj} 

= ^ki 

this immediately implies that (o l j = —⑴/， which is condition (a). 

As in the proof of Proposition 4, we have 

J^0 l a co\(Xj,X k ) = co^Xk)- 4(Xj), 

l 

while we also have 

d0 i (X J ,X k ) = Xji^iXk))- XkiO^Xj))- ^{[Xj^Xk]) 

by Theorem 1.7-13 
= 0 - 0 - Q l (V Xj Xk — ^x k Xj) 



this proves condition (b). 

For the second structural equation we expand 

E R/ ^^ = mk,Xi)Xj = v Xk y Xf Xj - - Vu,〆) 


d 0 = ~(o a 9 
do ) = — o ) a a ) Q , 
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to obtain 

= J2^L r ij - Vkj)^ ^ij) - wlj) - 

Comparing with 

_ ■ 

dco) + ( 為， A7) = - Xiico^))- a)}([X k ,Xi]) 

^ ll ^ 

_ ♦ * 

we see that this does indeed equal R ; y ^/ = Qj {X ^, A7) ♦ 

Notice that the results of Theorem 5 can also be written 

二 YA 、 X A Xi 
/ 

R(X ky Xi)Xj = J^Q i J {X k ,X l )X i , 

/ 

If we did not already have the V operator and its curvature tensor, then we 

声 . 

could, and E. Cartan did，use these equations to define R. Since the Q\ hence 

• » 

the a>j , and finally the Qj , all depend on the moving frame, it is then necessary to 
check that the resulting definition of R depends only on the values of A}, A^, X\ 
at a given point. We save until the end of the chapter some remarks about what 
is involved in that. At the moment, we would like to point out that we are 
already in a position to prove the test case, and thus begin to justify the epithet 
“structural equations”. 


6. THEOREM (THE TEST CASE; FOURTH VERSION). Let (A/， 〈， 〉) 
be an /^-dimensional Riemannian manifold for which the curvature tensor R 


is 0. Then M is locally isometric to M. n with its usual Riemannian metric. 
PROOF. Let Y\,.. ,,Y n be an orthonormal moving frame around p e M, and 


let 6 l and a> l j be its dual forms and connection forms. By assumption, we have 


d(Oj = — 〉: (O^ A (Oj'. 


k 
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Step 1. By Proposition 2, in a neighborhood of p there is a matrix A = {A l j) of 
functions with A{p) orthogonal and 


(*) 


dA. — — (o a. A.. 


Let B = A~ x and define 1-forms (f> 1 , which we can also write as 


Q = A 


Step 2. We have 


A. A d<f) — dO — {dA. A 4>) 

= —0) A 0 (a) /\ A A </>、 

= —a) aAa^>-\-{(oaAa^>) 

= 0 . 


by (*) and 

the first structural equation 
by definition of 6 


• • • 

So d<j> = 0, and consequently there are functions x l with <f> 1 = dx l • 

Step 3. We claim that x l . ,x n is the desired coordinate system, i.e.，that the 
natural frame 3/Sx 1 ,..., d/dx n is orthonormal. To prove this, we first note that 
the 1-forms # satisfy ^> l (Yj) = so 



Since the Yj are orthonormal, it suffices to prove that {A l j) is always an or- 

thogonal matrix. This is a consequence of (*) and the fact that a)= ( ⑴ j) is 
skew-symmetric. The argument for this conclusion has already been given on 
page 36: if * denotes the transpose, we note that A - A 1 satisfies the differential 
equation 

d{A - A 1 ) = —(a) A A • A 1 ) — (A • (co A A) 1 ) by (*) 

= —(p) A A • — (A - A} a co^) 

= -(x)A{A- A l ) + (A . A 1 ) A CO, 

with the initial condition A - >1*(0) = /. By uniqueness, the solution A ' A 1 — I 
everywhere. ❖ 
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We leave it to the reader to correlate the three Steps in this proof with those in 
previous sections. Note that the first structural equation involves the symmetry 
of the connection, as shown by the proof of Theorem 5, and that skew-symmetry 

of co is equivalent to the definition of the Christoffel symbols, since it is the 

• ___ 

condition which determines the ⑴ j in Proposition 4. 

In contrast to previous chapters, in this one we are going to give more than 
one proof of the Test Case. Although the proof just given is a natural use 
of the structural equations as integrability conditions, it does not illustrate the 
fundamental principle to be used in the method of the repere mobile, which is 
to choose the moving frame most suitable to the particular problem. When we return to 
the study of surfaces in 3-space，or submanifolds of Riemannian manifolds in 
general, we will see many instances of this principle. In our present setup there 
is one especially important moving frame, the investigation of which will take 
some time. 

Let Xip,..., X np be an orthonormal frame at M p . In a sufficiently small 
neighborhood of p we can define a moving frame X\,..., X n by choosing Xi(q) 
to be the parallel translate of Xi p along the unique geodesic from p to q. The 



moving frame X\,... ,X n is said to be adapted to the frame X \ p ,..., X np . In 
order to explore the properties of this moving frame, it is convenient to introduce 
the map 

M p ^ M 


defined by 


少 (f, = cxp(^ 


(actually, of course, is usually not defined on all of M x M p , but we will 
continue to write <I>: M x M p M for convenience). On M p we introduce 
the coordinate system t x .... ,t n by 0 Xjp) 二 a 1 ， and we use t for the 

standard coordinate system on M; we then let {t,t x , t n ) denote the obvious 
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coordinate system on E x M p . We now begin to describe the dual forms 9 l and 
the connection forms ⑴ j in terms of their pull-backs to M x M p . 


7. PROPOSITION. When we write W and in terms of dt' ， … ， dt n 
and dt, we have 

(D*^* = dt + e f 

= 0)j, 

• • • 

where 0 l and (b l j are 1-forms which do not involve dt. 

PROOF. We can always write 

= fi dt + e l 
⑴ j = g" dt 


it is only necessary to identify the fi and g”. To do this，we fix a 1 ,... and 
consider the geodesic 


Y(s) 



s e (s, s). 


This can be written as y = $ o c, where c\ (—e ， e) M x M p is c(.y)= 

{s ， Yli ai X ip)' So 

y*O l (s) = c*^*O l (s) = , ... ， a n ) dt 

y*a)j(s) = c*^*(o l j(s) = gij(s,a\ ... ， a n ) dt. 


On the other hand, 



e 




k 


Xk(y(s)) 


a' 


« 

which shows that fi = t l . Also 

y*^ [ji ) = 出 )([« 、 ( 咖 ))) = 

this sum vanishes, because Xj is parallel along y, which means that 

0z=W l ： ^^k X J = J2 akVx f< X J = J2 ak J2 T kj X i' ^ 

k k k i 
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In our next result we will look at “partial derivatives” of the 0 l and d) l j. If we 
are given an expression 

= Yl g j dfj ^ 

y=i 

then we will use the symbol 


~di 


for 


I： 


hi 

dt 


dt j 


and we will use similar symbols for the a)j [this definition depends on the par¬ 
ticular coordinate system^, ^ 1 , ..., Z 71 )]. Note that in 


dQ l = V 




a dt } , 


the terms involving dt are precisely 


dt A 


dt 


8. PROPOSITION. We have the following “structural equations in polar co- 
ordinates ”： 


dO l 


dt 

dt 


k 

= W 州 o ^ye l 
k，l 


^'(0,X) =0 for all X e M p 
o)j(0, X) = 0 for all X e M p 


PROOF. By Proposition 7 we have 


(1) ^*{d6 l ) = d(^*d l ) = dt 1 A dt dt A — — + terms not involving dt 

ot 

. . ㈣ . 

(2) ^*(do)j) = d(^*(i>j) — dt A ―― + terms not involving dt. 

(■ 
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On the other hand, by the structural equations (Theorem 5) and Proposition 7 
again，we have 


(3) ^(dd 1 ) = -J2 中 *^4 八 ^0 k 

k 

A (/力 +俨） 
k 

(4) = a (D*^ + a 6 l 


k 


vtr 


- YA 八砷 + \ E( R ; 。 m k dt^e k )A (t l dt + 9 l y 


k 


kj 


Comparing the coefficients of dt in (1) and (3) we obtain 



k 


which gives us the first equation in the theorem. Similarly, from (2) and (4) we 
obtain 

“h e (呦 。 d >) [心八 （ 〆 《- 〆 #)]; 

^ \ 

• ■ 

together with the relation jkl = —R^va ：? this gives the second equation. 

Since A") = p for all X e M py we have 伞 *( 0，；0 = 0， which gives the 
“initial conditions” 

P(0 ， Z) = 0 ， 4(0,1) 


___ _____ • 

9. COROLLARY. The forms 9 l satisfy the second order differential equation 

j,kj 

with the initial conditions 

0(0, X)=0 
j(0, X)=dt\ 

PROOF. Differentiate the first equation in Proposition 8 and substitute in from 
the second. <♦ 
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It should not be hard to see that for all X e M p we have 


(*) 



(UX) 



in this equation d/dt J denotes a tangent vector in R x M p as well as one in M p , 

• 

since we are using the same symbol t } for a coordinate function on M x M p as 
on M p . We have also written O* instead of 0*(1 ， X) ， etc. Equation (*) shows 
that 9 is determined by exp# and 0’ （ 1 ， X). This makes Corollary 9 particularly 
significant，since it determines 9 l (l, X) in terms of the functions o O. As a 
first illustration of this point, we bore ourselves to tears by twice again proving 


10. THEOREM (THE TEST CASE; FIFTH VERSION). Let (M, ( , )) be 
an ^-dimensional Riemannian manifold for which the curvature tensor R is 0. 
Then M is locally isometric to M. n with its usual Riemannian metric. 

PROOF. Step 1. Let X\^... ^X n be the moving frame adapted to an ortho nor¬ 
mal frame X\ p ^... ^ X np for M p . From Proposition 8 we have 


3a)j 

dt 


0 , 


句 (0 ， X) = 0, 


• _ • 

which shows that d)j — 0. This implies that coj = 0, since (*) shows that <!>* is a 
diffeomorphism at (1, X) and 


a )： — 


• • 

•。‘ 


• • 

Step 2. Since ⑴ j = 0, the first structural equation shows that dd l = 0. So 

0 = dQ\x h x k )^ Xjie^Xf,))- Xfcie 1 (Xj)) - e l ([Xj, x k ]) 

= -e i {[x j ,x k \). 

Thus [Xj, X^] = 0 for all j\k, so there is a coordinate system x\... ,x n with 
Xi = d/dx 、 

Step 3. This is the desired coordinate system, since the Xi are obtained from the 
Xi P by parallel translation, and are consequently everywhere orthonormal. 


In this proof it is still possible to separate the argument into the standard three 
steps. But in the next proof everything happens at once. 
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11. THEOREM (THE TEST CASE; SIXTH VERSION). Let (M, ( , )) be 
an «-dimensional Riemannian manifold for which the curvature tensor R is 0. 
Then M is locally isometric to R” with its usual Riemannian metric. 

PROOF. Let Xi ， … ， be the moving frame adapted to an orthonormal frame 
X \ p ^..., X np for M p . From Corollary 9 we have 


d2§l 0, P(o ， x) = o ， ?^(0,X) 


dt 2 




dt 


which implies that 


(/, X) — t dt\ in particular 0^(1, X) — dt 1 


So 




dtJ 


^>*9 l 


(UX) 


(exp 


* 


dtJ 


x 


)) 


W 


(1 ， 幻 


[for convenience we do 
not write 少 *0’ （ 1 ， X)] 


by (*). 


This shows that the value of the vector field Xj at exp X is 


Xj (exp X) = exp* 




Now the vector fields d/dt J on M p are orthonormal with respect to the usual 
Riemannian metric ( , ) on M p ^ so this equation shows that exp: (M p ,(,)) 
—(M，〈 ， >) is an isometry. ♦♦♦ 


Readers may sort out for themselves the vestigial forms in which the three 
Steps appear in this last proof. One thing does seem worth pointing out explicitly. 
In the two closely related proofs of Theorems 10 and 11 we use the device of 
expressing the integrability conditions /? = 0 in terms of the map This is 
roughly equivalent to the method outlined in Problem 1.6-8，where we solve 
a system of partial differential equations in M 71 by reducing them to ordinary 
equations along lines through the origin. 

The proof of Theorem 11, similar to the previous proof as it may be, is par¬ 
ticularly important to us, for the methods used may be generalized to arbitrary 

Riemannian manifolds (M, ( , )). To do this we introduce 1-forms 9 l on M p 
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by 0 1 (^) = O l (\, X). More precisely, ii X e M p and vx G {M p )x, then we 
have a tangent vector 

(0,vx) e (M x M P ) (UX ) 

(recall that the tangent space of the product of two manifolds is isomorphic to 
the direct sum of the tangent spaces of the manifolds), so we may define 

^(vx) ^^(X^vx)] = 少 ’( 1 ， 1 )(( 0 , 吵 )). 

. . 

In particular, we have (leaving out the arguments for 9 l and 6 l ) 



Now define a tensor ( , ) of type (^) on M p by 

n _ 

《 ， ) = 

/ =i 
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Recall (page 194) that for a 2-dimensional subspace W C M q of the tangent 
space of a Riemannian manifold (M， 〈 ，)) we define the sectional curvature 


K(W) as 


K(W )= 


(R(A,B)B,A) 


A，B a basis for W. 


Let p e M be some fixed point. For every X e M p and 2-dimensional subspace 
V C M p , we will let L(X, V) be the sectional curvature K{W), where W C 
M CX px is the parallel translate of V along the geodesic t \-^ exp tX. 


13. COROLLARY (THE CURVATURE DETERMINES THE METRIC). 
Let M and M f be two Riemannian manifolds, and T : M p M’〆，an isometry 
for some p e M and p f e M\ Suppose that L(X, V) — L f (T(X), T{V)) for all 
2-dimensional subspaces V C M p and all sufficiently small X. Then there is an 
isometry from some neighborhood of /? G M to a neighborhood of p f G M f . 

PROOF. Choose an orthonormal frame X\ p ,..,, X np e M p , let X\,... ,X n be 

the adapted moving frame in M, and let X\ ，…， X f n be the moving frame 

in M’ adapted to T{X\ P ),... ,T{X np ). Let O: M x M p M and 巾 ’：M x 

M ! p t M’ be as defined previously, let 6 l and 6 1 be the corresponding forms 

on M x M p and M x M’〆，and let S: M x M p M x M’〆 be S(a, X) — 

• _ 

(a, T{X)). From the definition of R^/ before Theorem 5 we see that 

R’ 州 = 〈 /?( 為， 

The hypotheses of the theorem therefore imply that 

R%. o 少 = 。（少 ’ o S) for all /, j. 

From Proposition 4-12 we deduce that 

RV/ o CD = R fi Jkl O (d> , O S) for all /, 

Now Corollary 9 (and the uniqueness of solutions of differential equations with 
given initial conditions) implies that 

9 ( ^6 fi oS, 

and therefore that _ _ 

P = e fi o t. 

This means that T is an isometry of (M p , ( , )) and (#’〆，<，〉’)• The result 
then follows from Theorem 12. ♦♦♦ 
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This corollary is the main assertion made by Riemann in his Habilitations 
lecture. We have proved a purely local result, but global results have also been 
obtained; see Ambrose, Parallel translation of Riemannian curvature, Annals of Math. 
64 (1956), 337-363. 

The result of Corollary 13 is perhaps not what the reader may have under¬ 
stood by the assertion that “the curvature determines the metric”，for it involves 
the parallel translation in the manifold, and not merely the curvature. Notice, 
however, that any rigorous statement about curvature determining the metric 
must involve a map f: M ^ M’，so that we know what it means to compare 
curvature in M with curvature in M’; in our case the map / is exp〆 o(exp^) _1 . 
In this connection the following rather different question has always seemed to 
me the more interesting one. Suppose we have a diffeomorphism f: M ^ M ! 
such that for every 2-dimensional V e M q we have K(V ) : = K f (MV)); then 
is / an isometry? It is easy to see that as stated this is not true，because any 
diffeomorphism /: M tt ^ R n satisfies the hypothesis (when M tt has its usual 
Riemannian metric)，but not necessarily the conclusion. It is also easy to obtain 
other examples. Consider the sphere S n C M tt+1 of radius a, with the induced 
Riemannian metric. For w = 2， we know that K(S 2 P ) = K(p) — 1/a 2 . If 
n > 2 and (9 is a neighborhood of 0 in a 2-dimensional subspace V C M p , 
then exp(0) is isometric to a portion of 5 2 , so we have K(V) = 1 /a 2 for all 
such V. Consequently, once again any diffeomorphism /: S n S n satis¬ 
fies the hypothesis of our question, but not necessarily the conclusion. As we 
shall see in Addendum 2, there are also examples where all K(V) are the same 
negative number. 

In addition to these rather special counterexamples, it is simple to construct 
infinitely many other 2-dimensional examples. If we have a 2-dimensional mani¬ 
fold M such that the sets K = constant give a foliation of M，then we can choose 



/ •• M —> M to be any diffeomorphism which keeps each folium fixed as a set. 
There are also specific classical examples of 2-manifolds M, M ! which are not 
isometric under any map，but for which there is a diffeomorphism f: M —> M f 
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with K\f{p)) = K{p) [see pg. III. 166]. Perhaps these examples are responsi¬ 
ble for the fact that the higher dimensional cases remained unsettled for so long. 
Though it might be natural to assume that this case is even more hopeless, just 
the opposite is true: when n >2, a diffeomorphism which preserves sectional 
curvatures is, roughly speaking, an isometry, except for the special counterexam¬ 
ples mentioned in the previous paragraph. For details the reader is referred to 
R. S. Kulkarni, Curvature and Metric, Annals of Math. 91 (1970) ， 311-331. 

Before we proceed further with the study of moving frames, we pause to note 
that in the 2-dimensional case, Corollary 9 (and Corollary 13 which depends 
on it) have really been available to us since Chapter 3. Recall that we chose 
polar coordinates (p ， 0) on M p , and used them to introduce polar coordinates 
(r,(p) = (/>, 0) ° exp -1 on a neighborhood of p [see page 136]. If we write 

〈，、 =dr 公 dr + G dcp ② dcp, 


and define g on M 2 by 

^ = G o exp o (p ， 0) 一 1 ， 

then we have the following formulas (collected together on page 145): 






( 0 , 0 ) =1 


dp 2 


(p ， 0) = — v^(P ， 0) • K(exp(p, </>)). 


These equations are easily seen to be equivalent to the equations in Corollary 9, 
and can be used the same way. For example, if 尺 = 0 in a neighborhood of 
then 

系參 0 ; 

together with the initial conditions，this shows that yfg = />, so 

〈， )=dr ^ dr r 2 d(p <S> dcp. 


which is exactly the expression in polar coordinates for the usual Riemannian 
metric on JR 2 . Even the w-dimensional Test Case could be proved in this way，by 
considering exp(K) for various 2-dimensional V C M p . However, the equations 
of Corollary 9 are generally most useful in higher dimensions. As an exercise 
in using them, Addendum 1 derives the form of the metric in an n-dimensional 
manifold of constant curvature. 
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Everything which we have done so far in this chapter has involved the Levi- 
Civita connection associated to a metric 〈，〉； moreover, we immediately in¬ 
terpreted the connection and curvature forms in terms of concepts defined in 
previous chapters. But the method of the repere mobile is meant to treat ar¬ 
bitrary connections, and was used to define the curvature tensor before the V 
operator had been invented. For the remainder of this chapter we will be con¬ 
cerned with this independent development of the theory of connections. We 
want to describe any connection in terms of “connection forms” ⑴ j, but we do 

not necessarily want the matrix ⑴ = (o)j) to be skew-symmetric, so we do not 
have Proposition 4 to guide us. We do want to have the equations 

^x k Xj = T<o){X k )X u 

i 

which are a consequence of Theorem 5; these equations can also be written 

vxrYAh 

i 

For convenience, we let X = (X\, ..., X n ) and abbreviate this equation as 
VX = X • a) (recall the notation introduced on page 26Iff.). 

Now consider another moving frame X 7 = X • a. We want to have 

= VX'j = 

— ▽ A i v [recall the formula for V x fY 

7 / + t ； ； and note that X(/) = ( // W] . 

Using our matrix notation, this means that we want 

X ， .o/ = VX，= V(X.a) 

=X * dci -h VX • ci 
=X * da + X •(⑴ • a )， 

so we want 

X • (aa) f ) = (X - d)o) = X’ • ⑴’ =X • (da + (oa) 
for all moving frames X. Thus we want the condition 
(*) a) f = a~ x da + a~ X 0L>a. 
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Hence we are led to the following definition:* 

A (Cartan) connection on a manifold M is an assignment of a matrix 
0 ) = (o)j) of 1-forms to every moving frame X such that equation (*) 

holds between the 1-forms ⑴ j assigned to the moving frame X and the 
1-forms a) /l j assigned to the moving frame X’ = X • a. 

Given a Cart an connection，and a moving frame X = X\,..., X n , the 
1-forms o)j which are assigned to X are called the connection forms for X ， 

and we define the dual forms d l by O l (Xj) = 8 l j. From these forms we can de¬ 
fine all the tensors which arise in Chapters 5 and 6. What follows is an outline 
of such a development of the theory of Cart an connections, independently of 
previous considerations. 

We begin with a simple observation about the consistency of the transforma¬ 
tion laws (*)• Suppose X" = X’ . = (X • a) • = X • (ab) is another moving 

frame, and that a/, a) ff satisfy 

— a~ x da + a~ x o)a 
a) ff = b~ l db + b~ x a) f b. 

Then 

⑴"二 b~ l db + b~ l (a~ l da+ a~ x a)a)b 

=[6 一 1 a 一 1 (da)b + b~ l a^ l a(db)] + b~ x a~ x coab 
= (ab)~ l d(ab) + {ab)~ l (oab. 

This shows that if we are given connection forms for a certain set of moving 
frames whose domains cover M, and the various pairs of connection forms all 
satisfy (*)，then there is a unic.ue Cart an connection that assigns these forms 
to these particular moving frames. In view of this remark, it is very easy to 
determine a connection from a Riemannian metric. 

14. PROPOSITION. On a Riemannian manifold (M, 〈， 〉) there is a unique 
Cartan connection with the property that the connection forms (j)\ for any 


* We are using the term “Cartan connection” as a convenient label, but the reader 
should be warned that in the literature this term is used for a different concept. 



282 


Chapter 7 


orthonormal moving frame X satisfy 

0)j = —d)j 

d9 l = > : 沒 灸 A o)^. 
k 

PROOF. We already know，by Proposition 4, that for any moving frame there 
are unique 1-forms ⑴ j with this property. We just have to check that if X and 
X=X a are orthonormal moving frames, then 

a) f — a~ l da + a~ X (joa. 


In view of uniqueness, we just have to show that if the forms a) l j satisfy the 

conditions of the theorem, and the forms a) fl j are defined by this formula, then 
they satisfy the same conditions ， 

(a) o) fl j = -a/j 

(b) de fi = 气 

k 


Since X and X' are orthonormal，the matrix a is everywhere orthogonal ， 
a ^ a 1 = I, where * denotes the transpose. So if 0 denotes the zero matrix 
we have 

0 = da - a 1 a • da 1 , 


or 


da 1 = —a~ l da • a 1 = —a 1 - da - a 1 . 


Consequently ， 


(1) {a^da) 1 = {^daf — da 1 - a = —a 1 • da • a = —a 1 • da = —a~ x da 

(2) {a^coa) 1 — (^coa) 1 = Wa = = —a~ x a)a. 

Clearly (1) and (2) imply (a). 

To prove (b)，we first note that 

O fi (Xj) = e fi {^{a- x ) k } X f k ^j = (a- 1 )) = p(cr%e k (Xj )， 

so 9 fi = J2k( a ~ l yk^ k ^ or 


a 
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Consequently，* 

⑶ d9 f = (— - da - a -1 ) A 0 + a~ x d9 

—— ci ' dci /\ ci ' 0 _ (a i co 八 0). 

On the other hand, 

(4) o) f A d f — (a~ x da + a~ x coa) A {a~ x 9) 

= a~ x da A a _1 0 + (a -1 w A 0). 

Clearly (3) and (4) imply (b). 

When we pass from the particular connection of Proposition 14 to a general 
Cartan connection, both structural equations need correction terms. If a)j are 

the connection forms for a moving frame X， with dual forms 0’， we define 

* * 

2-forms 0 1 and by 

dO = —co A 0 + 0 i.e .， d9 l = — ^ co l k A 0 k - & l 

k 

dco = —co Aco -\-Q i.e .，dcoj = — ^^co l k A (Oj + Q l j. 

k 

» » 

We call the @ l and Q l j the torsion forms and curvature forms for the moving 
frame X. We now compute the transformation formulas for these forms. 

15. PROPOSITION. If ©〃 and Q fi j are the torsion and connection forms for 
another moving frame X 7 = X - a, then 

0’ = a 1 • 0 
= a -1 ^ ci. 

木 To compute d(a~ l ) we differentiate a - a~ x = / to obtain 

da - a~ l + a - d(a~ 1 ) = 0, 
d(a~ x ) = —a—'daa —、 
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PROOF. We have 


⑴ 

(JL> f 

= a~ l da + a~ l o)a 

⑵ 

dd 

=—CO A 0 + 0 

( 3 ) 

de f 

= —(x) A 0’ + ©’ 

(4) 

dco 

— —(JO A(0 Q 

⑶ 

da/ 

=—0) f A 0) f - 


and，as in the proof of Proposition 14 ， 

( 6 ) e f = a~ l e 

(7) cW f = -a~ l da A a~ x 0 -f a~ l d6. 

So 

(8) d9 f = —a~ x da / \a~ x 6 a~ x {—oj a 0 -f 0) by (2 )， （ 7). 

We also have 

(9) de f ^ -0) f a 0 ， + 0 ， by (3) 

= —(a~ x da + a~ l coa) A a~ l 6 + @ r by (1) ，（ 6) 

= —a~ x da A a~ { 6 — a~ l (⑴八 0) + ©’• 

Comparison of (8) and (9) gives a~ l & = @ / . 

From (1) we obtain 

( 10 ) da> f — [—a~ l daa~ l A da] + (—a 一 1 da a~ x ) A coa 

+ a 一 1 dcoa — a~ x co A da 
= [—a~ l daa~ l A da] + {—a~ l daa~ l ) A oja 

+ a~ x (— 0 ) aco -— a~ x a> A da by (4). 

We also have 

(11) do) f — —(jo f Ao) f -h Q f by (5) 

= —(a~ l da + a~ l coa) A {a~ l da + a~ l coa) + Q f by (1) 
= —(a~ l da A a~ x da) — {a~ l daa~ l A coa) — {a~ l co A da) 
_ {(1 ^ 0) /\ 0)(l) -j- , 

Comparison of (10) and (11) gives a~ l Qa = Q f . <♦ 
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The three relations 

(1) a) f — a~ l da -a~ x a>a 

(2) 0’ =： a~ l @ 

(3) Q f = a~ l Qa 

are precisely what enable us to define 

(1) VY for vector fields Y 

(2) T(X 7 Y) for tangent vectors X, Y 

(3) R(X, Y)Z for tangent vectors X,Y,Z. 

We essentially know this already for the V operator，which we consider first. 
Given a moving frame X = ( 不 ， ..., X n ) with connection forms ⑴〜， we define 

▽ X = X .⑴ A or V x X j =Y,i^)^i] 

and extend this to arbitrary vector fields Y = ^ - Xj by defining 


▽. 


(㈣ =; db j • Xj +b 〗 VXj. 


For another moving frame X’ = X • a with connection forms ⑴ we have 

▽ X ，= X a) f = {\ a) a) f = {\ a)- [a~ l da+a~ l a)a] 

=\ [a • (a~ l da + a~ l coa)] 

~ X ■ do, "h X ■ ((i) ■ (2) — X ■ del "h (X ■ (i)^ ■ Cl 
=X . da + VX - a, 

which shows that the definition in terms of the two moving frames are consistent 
[this equation becomes 

▽( 2 > 沐 ) = E da r^+ 

= ^2da i jX i + ^2a i jVX i . 

« « 

l l 

when written out]. 

We next define 


T(X h X k )^Y, & ^ X h x k)^i 
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and extend to arbitrary tangent vectors X : = J ： j^Xj,Y = J ： k c k X k by lin- 
earity, 

r(X, Y) = jyc k T(Xj ， X k )- 
For the moving frame X’ = X • a we have 

T(X f ^X f v ) = J2 0, W, xf v)X f P 

p 

P l V j k J i 

= J2 a l a vJ2 @i ^ x J^ x ^ Xi 

M * 

= TA ak v T ⑶， 

j，k 

so the definitions in terms of the two moving frames are consistent. The ten¬ 
sor T is clearly alternating, since the 0* are 2-forms. 

Finally, we define 

R(X k ,X l )Xj = J2 Q j( X ^ X ^ 

i 

and extend by linearity. For the moving frame X’ = X - a we have 
R{X f ^X f v )X\ = 

p 

P j\m 、 k l 

=J2 ak A a i Qi jd x o x “ 

j ， k，l 


which again shows consistency. Clearly R is skew-symmetric in the first two 
arguments, since the Qj are 2-forms. 



If we now define 
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^x k x j = J^r i kj x i 

i=l 

n 

T{X h X k ) = Y,Vi 

| = 1 
n 

R(X k ,X { )Xj = ^ Ri Jki^i, 

/=i 

then we have 

T) k =co)(X k ) or ^j=T, r kj 0k 

k 

T jk =&{X h X k ) or & = \^T} k 0 J A 6 k 

M 

= or Q i J = l ~Y / ^ i jkie k A 0 l . 

1 kj 

Consequently，the “structural equations” 

dO 1 = A0 k + & = A0 k + A0 k 

k k j,k 

d(Oj — — (O^ A (Oj + = — 〉: (O^ A (Oj + — 〉: 切 A Ql 

k k 2 kj 

k 

are purely a matter of definition. However，one can now easily reverse the 
computations in the proof of Theorem 5 to show that 

T(X, Y) = V x Y-V y X-[X, Y] 

R{X. Y)Z = Vjr(V r Z) - V Y {V X Z) - V [jr , r] Z, 

thus verifying that T and R are the torsion and curvature tensors for V as 
defined previously. We leave this computation to the reader, as well as the task 
of deriving the general form of the structural equations in polar coordinates. 

Even though the structural equations are merely definitions in this approach ， 
we can derive new relations from them. 
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16. THEOREM. We have the following relations between 0, co, 0, and Q: 

(1) (Bianchi’s first identity) 

d® + = 

(2) (Bianchi’s second identity) 

d^l (co A ^2) 一 (^2 A = 0. 

(Notice that co aQ and Q Aco are not equal up to sign, because Q and co are 
both matrices of forms and the order of matrix multiplication plays a role.) 

PROOF. We have 

0 — d{d6) = d{—co a 0 + 0) 

— _ d(x) /\ 0 (^CO A dO^) -h ^/0 

= — {—(JO AO) + f2)A0 + [a)A (—0) A 0 + 0)] + d® 

— _ ^2 A ^ + 0 -j- ^/0. 

Similarly, 

0 = d(dco) = d(—co a o) - f2) 

—— d(JL) A it) A d(X)) d ^2 

— _ ( _ CO A it) ~h ^2) A it) \o) A ( _ CO A 66) -J- ^2)] d^l 
— _ ^2 A 66) ~h (CO A ^2) ~h ^ 

It takes quite a bit of calculation to convince oneself that the equations in 
Theorem 16 really are the Bianchi identities. This calculation is left to those 
readers who have more endurance than the author; we will merely consider the 
special case of the first Bianchi identity for a connection without torsion. In this 
case 0 = 0, so the identity is just A 0 = 0. Thus we have 

0 = (f2 A 6) 1 — 〉 : M A 6^ = — 〉 : R’y 灸 /0 灸 A 0^ A 0 J ' . 

j 1 j ， k，l 

Applying this to (A^, A/, Xj) we obtain the familiar formula 

0 = ^ R， '^ - Ri 仲 + R V-RW + R^-RV} 

— ^ l jki + R ^/； + R’ / 从， 

(and thereby see why cyclic permutations of the indices should be involved). 
It turns out that taking the exterior derivative of the Bianchi identities does 
not give us any new relations, which indicates that these identities are the only 
general ones we should expect to find. 

To complete the present chapter we also derive the relations satisfied by the 
curvature tensor for the Levi-Civita connection. 
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17. THEOREM. Let (Af, ( , )) be a Riemannian manifold, and consider the 
unique Cartan connection of Proposition 14. Then for every orthonormal mov¬ 
ing frame we have 

⑴ Qj = 一 Q/. 

Consequently, the curvature tensor R satisfies 

(2) (R(X, Y)Z, W) = -{R(X, V)W,Z) 

(3) (R(X, Y)Z,W) = (R(Z, W)X,Y). 

PROOF. Equation (1) is immediate from the fact that a>j = —a)j (by assumption) 
and the second structural equation, 

d(i>j = — 〉: 0)^ A 0)j + Qj. 
k 


Since 


{RiX^X^Xj^Xi)^ 


I^R^juX^Xt 


=^ l jkl = n l j(Xk ， X “， 


equation (1) implies (2). Then，as before, (3) follows from Proposition 4-11. 
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ADDENDUM 1 

MANIFOLDS OF CONSTANT CURVATURE 


A Riemannian manifold (M, ( , )) has constant curvature A^o if for all p e 
Af ，and all 2-dimensional V C M p we have K{V) = A^o. If M is 2-dimen- 
sional, this just means that K(p) = Kq for all /? G A/. In this case, we can easily 
find the form of the metric ( , ) from the considerations on page 145. We use 
geodesic polar coordinates (r^cp) = (p,0) o exp -1 so that the metric is 

(,)=dr ^ dr G d(p ^ dip. 


If g on M 2 is defined by 发 =G o exp o (p,0) -1 , then we have seen that 


V^(O,0) = O 


K/l 

^p~ 


( 0 , 0 ) = 1 


= — 欠 OV^(/O,0). 

The general solution of the last equation is 


^/g(p, <f>) = c\ sin v^o p-\~c 2 cos 

y/g{p,<t>) = C\ sinh \/—Ko p + C 2 cosh \J —K。p 


Taking into account the initial conditions, we find that 

s {/ \K^\p) 


欠 o > 0 
Aq < 0. 


where 


s denotes 


sin whenever we are dealing with A^o > 0 

” K 0 < 0. 


sinh 


It follows that the metric is given by 


(,)=dr ^ dr -\- 


s 2 (vW 


vW 


d(p 0 dip. 


To obtain the analogous results in the ai- dimensional case, we will return 
to the equations of Corollary 9. However, we will need a preliminary result. 
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which applies to spaces of constant curvature as a special case. A point p in 
an «-dimensional Riemannian manifold (M ， 〈 , }) is called isotropic if the sec¬ 
tional curvatures K(V) for all 2-dimensional V C M p have the same value A^o ； 
this means that for all X,Y € M p we have 

(R(X,Y)Y 7 X) = K 0 ^\\X y Y\\\ 

where || X y F|| is the area of the parallelogram spanned by X and Y. Applying 
the formula on pg. 1.308 (to the subspace of M p spanned by X and Y) y we can 
write this equation as 

Y)Y,X) = K 0 [(X,X)(Y,Y)-{X, Y) 2 ]. 

18. LEMMA. If p € M is isotropic, with all sectional curvatures equal to A^o, 
then for all X y Y y Z,W £ M p we have 

(R(X 7 Y)Z,W) = Ko[{X, W)(Y,Z)- (X,Z){Y,W)l 

PROOF. Denote the right side of this equation by (R(X, Y y Z, W). By hypoth¬ 
esis, (R(X, Y)Y, X) = (R(X, Y, Y y X). It is easy to check that (R has properties 
(1)—(3)，and hence (4), of Proposition 4-11. The desired result now follows from 
Proposition 4-12. ♦♦♦ 

Although not necessary for our calculations, the following consequence of this 
Lemma is a standard result, and is pertinent to the topic of constant curvature 
manifolds. 

19. THEOREM (SCHUR). If M is a connected Riemannian manifold of di¬ 
mension n > 3 and all points of M are isotropic, then M has constant curvature. 

PROOF. By Lemma 18 we have, in a coordinate system x, 

Rhijk 二 K[ghjgik _ ShkSij] 
for some function K on M. Equivalently, 

R h i]k = 一 

Consequently, Ricci’s Lemma (Proposition 5-3) and the relation = 0 imply 
that 


Rh Uk、i 二 - 心 " ). 
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From Bianchi’s identity (Proposition 5-9(3’)）we then obtain 


0 = 為 ( 巧 gik - S kSu) + - + 為 ( 砟如 - S jSil) 


In this identity set h = k and sum over A:, to obtain 


0 


dK 

dx l 


(g 


u 


、 dK , 

ngij) + (ngu - gu) 


dK 

dx i 


dK 


Sij 


dxJ 


tSu 




(n - 2) 


dK 


dxJ 


rSu 


dK 

d? gij 


Since « > 3 we have 



dK 

dK 



W g " _ 

dx， glJ . 


Hence 

^3 尺 v 八 dK 

^ dxJ = ^ g g// W = 

i 

gl} dx l 一 

i 

: <5. 

Choosing m 

=j ^ /, we obtain dK/dx 1 

= 0, for all l. So K is constant. 


To obtain the metric in a space of constant curvature 尺 o, we now consider 
the equations of Corollary 9. By Lemma 18 we have 

故 jki = Xi)Xj,Xi) = K 0 [8kiSij - 8 k j8u], 

which implies that 

R^'/y = -R 7 ；；/ == Kq (j ^ i); all other R l ju = 0. 


So our equations become 

d 2 e l 


(*) 


dt 2 


w {^9 ( - t l 9 k ) 

k 


To solve even these simplified equations requires quite a bit of trickiness. We 
first use the equations in Proposition 8 to obtain 


E 你 


y t i dt i t k t i 

i i，k 


dt 
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and thus 




dt 


t l dt l ， using skew-symmetry of the a) l k = 


Since t l d l (0 7 X) — 0, we have 




and hence 


，氺氺 , 


= J2 titj§i§j = t2 E titJdtidtJ = t iT, tidtl 

V / J Li Li V / 


2 


a —^ * a a 

We next obtain equations for the quantities t l 9 J — t J 6 l . By (*) we have 


d 2 (你 
dt 2 


rt 


Ko ^ t k t i (t k 9 J - t j 6 k ) + t k t j {t k 9 l - i l 9 k ) 


b 


n 


KojyWW - 冲 ) 


k—\ 
2 


K 0 p z (t l e J —t J e 1 )' 


where we have set 


P 


n 




2 


k=l 


Together with the initial conditions 


(1 1 9 ] -^^)(0,^) =0 


d(tW -t J e i ) 

dt 


(0 7 X) =： t 1 'dt J -t ] dt\ 


we obtain 


t l e J -t J e l = 


py/\Ko 
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Recall that for a form r], we use rj 2 to denote the quadratic function X 
rj(X) - r](X). Summing the squares of the equations just derived, we now obtain 


4 厕 ’) — 


2 


P 2 \K^\ 


<j 


s 


2 


{pVlKo\t) 


2 p 2 \K 0 \ 


(t l dt j - t J dt l ) 


2 


hJ 


hJ 


- tWf = - ^ O 7 ') 2 ^ 7 ') 2 + { 1 ] ) 2 { 6 1 ) 2 - 2 〆， w 


IJ 


y^(r f ) 2 (^ 7 ) 2 — t 2 dt l dt 】by ( 林 : 

ij 


hj 


p 2 ^(o k f -t 2 (j2 tkdtk 

k=\ \ k 


2 


Since exp : (M p , 9 k d k ) (M, ( , )) is an isometry, and 9 is the value 
of 9 l when ， = 1， the expression for || || 2 in normal coordinates is obtained by 
seeing what U 妒 、 2 becomes when we set ^ = 1 and t k = x k in the above 
equation. We let r denote what p becomes when we perform this substitution; 
that is, we let 


n 




2 


k- 


We then obtain 


2 


.2 



x k dx k 


2 s 2 


{ViKo\r) 


\K 0 \ 


,2 


> (x l dx } — x } dx l ) 


2 


i<] 


Notice that we have 


x k dx k \ — r 1 ^y^{dx k ) 2 — dx } — x } dx 1 ) 2 : 

\ k ’ k 」 /■<; 
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consequently, we can write 


T(dx k ) 2 - T!^ok 2 -s 2 (/^r)1 y (x i dx J _ X J dx iy 

k L 1 尺 o|r J / <y 


From the Taylor series for sin it is easy to see that the coefficient of the sum 
Yli<j( xi dx 】 —x 】 dx 1 ) 2 is C°° (in fact it is analytic). This formula thus gives 
a direct verification of Riemann’s assertion about the form of || || 2 in normal 
coordinates (page 168). On the other hand, this form of the metric is not the 
one which Riemann mentioned in his lecture (page 159). We will consider this 
metric in Addendum 2. 
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ADDENDUM 2 

CONFORMALLY EQUIVALENT MANIFOLDS 


In a vector space V with a positive definite inner product ( , ) we define the 
angle Z(v, w) between two non-zero vectors v,w € V by 


Z(v 7 w) = arccos 


(^,y) 

IWIK 


It is easy to say when two metrics give the same angle measurements. 


20. LEMMA. Let 〈， ）i and 〈， h be two positive definite inner products 
on V. Then L\ (u, w) = w) for all non-zero u ， u; € V if and only if there 
is a number c > 0 with ( , ) 2 = c •( ， ）卜 

PROOF. If ( ， ）2 = c . 〈 , )i, then clearly w) = ^i(u, w) for all v,w ^0. 

Conversely，suppose w) — Zi(u, w) for all u, w ^ 0. Let v\,... ,v n be 

an orthonormal basis for V with respect to ( ， ） i. Then for i ^ j we have 


— (W ， ) = 0, 


so {vj,Vj) = 0. Define c\ by (Vi ， Vi) = q. Then 


+ vj) 


(W, Vi + Wh 


+ vj) 


V( v fy v i)l y/( v i + v J^ v i + u /)l 迟 

(Vf 7 Vi + Vj) 2 


Ci 


y/{VhVi)ly/{^i + Vj , Vi + Vj ) 2 y/Ci^/Ci +C; 


It follows that Ci = Cj. 




Now a diffeomorphism /: (M\, { ， ） i) — (M 2 , 〈， ） 2 ) between Riemann- 
ian manifolds is called conformal if each f^ p is angle preserving，and the two 
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Riemannian manifolds are then called conformally equivalent. By Lemma 20, 
a diffeomorphism / is conformal if and only if 


< ， >1 =A/*( , > 2 

for some positive function A : M\ R. In particular, if a Riemannian manifold 
(M， 〈，” is conformal to W 1 with its usual Riemannian metric, then around 
each point /? € A/ we can choose a coordinate system x such that gij = 
for some positive function A: M M. Such a coordinate system x is called 

isothermal. 

In 1822 Gauss showed that isothermal coordinates can be found at any point 
of an arbitrary surface. His proof depends on a trick that works only for analytic 
(C w ) manifolds, and uses a little knowledge of complex function theory. It is 
presented in Addendum 1 to Chapter 9 of Volume IV, together with a (much 
more involved) proof that works in the C°° case. For ai > 2 it is not true that 
every 万 -manifold is locally conformally equivalent to M w . But certain important 
Ai-dimensional Riemannian manifolds are. In fact，in his lecture，Riemann states 
that on a manifold of constant curvature Kq there is a coordinate system x with 


n 




2 


i + ¥w ) 2 


or equivalently 



dx l ® dx l 



W ) 2 ] 2 


Thus, x is an isothermal coordinate system. 

It wasn’t very hard for Riemann to guess this，because there is a very standard 
coordinate system for S n which gives this metric (with Kq — 1 ). We consider S n 
as the sphere of radius 1 around the point ( 0 , … ， 0 , 1 )，so that S n is tangent to 
x { 0 } C M w+1 . Let * be the “north pole，，，* = ( 0 , … ， 0 , 2 ) € S 71 . The 
stereographic projection 


a: — ^ R n 


is defined geometrically as follows: for any ^ * in S 71 ，we let o{p) be the 
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point where the line between p and * intersects It is easy to check (see the 

氺 2 




plane section on the right of the diagram) that 


( 1 ) 




V 


2 /? 


圹 +i ’ ’ _ _ ’ 2 -圹 +i 




and that / = cr _1 is given by 


( 2 ) = f(y) ^ 




少 


n 


iE (/) 2 


i + i E (/) 2 


|W) 2 ，i + iE(y) 


2 


1 * • 

If y\...,y n denotes the standard coordinate system on R w , then the x l = y l oa 

give a coordinate system on S n — {*}. 

«+i . . 

Let i : S n R w+1 be the inclusion map, and ( ，》 =(g) the usual 

/ =i 

Riemannian metric on M w+1 , so that i*( , ) is the usual Riemannian metric 
on S n . We find the expression 

n 


/*〈，）= ^ gij dx l <S> dx } 


for i *{ ， ) by writing 


n 


n 


I(gij 。 f 、 dy l 公 dy j = gij dx l <S> dx j 


、j 




/*，•*〈，> = /*〈，> 

«+l n+\ 

p 0 dz i = J2 d f l ^ d f 




/=! j\k = \ 


dy J dy k 
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• • 

and computing the df l /dy J from (2). A forbidding looking computation finally 
coalesces into 


n 


n 


^ (gij 0 f) dy l ® dy } = 


dy l ® dy l 


j 




2 


so that the expression for the metric i*( , ) is 


E 

i=i 


« • 

dx l ® dx l 


+ ;W ) 2 


2 * 


(A neater way of deriving this formula is presented in Volume IV， Chapter 7.) If 
we were dealing with a sphere of curvature Ko, the factor 1/4 would be replaced 
by Ko/4. 

Of course, we still have to check that this formula gives a metric of constant 
curvature Kq when Kq < 0. We will consider，more generally, all metrics on (a 
subset of) W 1 which have the standard coordinate system as isothermal coordi¬ 
nates, and determine just which of these have constant curvature. 

Thus, we consider metrics on (a subset of) W 1 of the form 


gij = 


^U_ 


F nowhere 0. 


Then g 1 ^ — 8 l ^F 2 . We also have 

dg—28ijdF —28" 3 log F 
= 尸 3 9jc^ = ~J~ 2 dx k 


Setting log F = f, and using the formulas on pp. 1.326 and 1. 328, we obtain 


r?. 


-9 / V i t = K 

3jc j 力 dx l 


r b = r h 


-3 / 


( l ^ ；)； 


all other T l - k = 0. 


From the formula on page 214 we obtain 

a 2 / . a 2 / 


jij 


R l n 


jji 


a 2 / 


dx^dxJ dx ! dx l 

3/ a/ 


jjl dxidx’ dx i dx l 


r 料 j 
i, j, l distinct 


5 倍 )2 




= + iJJ distinct 
all other R l )kl = 0. 
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Now by Lemma 18, the metric has constant curvature Kq if and only if 


Rijkl — K Q (g ik gji — gngjk), 


or equivalently 


R l jkl = K Q (y k gj 〖- S\gjk) 


and hence 




-R l 


jji 


K 0 

J2 ^ 


all other R!}kl = 0. 


So the metric has constant curvature Kq if and only if 


Since 


3 2 / 

3 / 3 / 

= 0 


dx^dx l 

dx^ dx l 


3 2 / 

a 2 / 

■ 

r^ij 


dx l dx l 

dx J dx J 


a 2 / 

,a/ 3 / 

i ^ ^ > 

1 

a 2 / 


2 


尺 0 
F 2 




i + / 


F dx^dx l 


for all j, l 


these equations hold if and only if 


( 1 ) 


( 2 ) 


d 2 F 


F . 


d 2 F 


3x , 3x / 


dx^dx l 

d 2 F \ 

dxW ) 


0 


尺 o + 


A / 9F \ 2 




i ^ ；• 


Equation (1) implies that F = G\ + … + G”，where Gj depends only on x 
Using equation (2) for /， / and then for )， /， we obtain 

d 2 Gi d 2 Gj 


dx i dx i dx^ dx^ 


So we must have 


Gi — ext 2 + biXt + at ， 
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for some c. From (2) we then obtain 


ft 


Ko = — b r 2 ). 


There are two important special cases. If we choose c = Ko/4, b[ = 0 ? at - 
l/n we obtain the metric ( , ) = zlgij dx l (g) dx J which Riemann mentions 
with 

gij = 


Ko 

~4 


Ei( xi ) 


2 


2 ' 


Notice that for Kq < 0, this metric is defined only on 


n 


M 


a e R n : < - 4 /[o 卜 


Nevertheless, (Af ， { ， )) is complete. To see this，we compute that the curve y 
defined by 


K 1 ^) 


sinh 


yj^oi 

" T~ 


y/\Ko\ cosh 




2 


y l (0 = o 


> 


is a geodesic through 0, parameterized by arclength，and defined for all t. Since 
the metric { ，〉 is radially symmetric around 0, there are geodesics through 0 
in all directions, which are defined for all t. 

When 尺 o > 0, the same metric is defined on all of but it is not complete ， 
since it is isometric to the w-sphere of radius with a point deleted. Some 
theorems from Chapters 7 and 8 of Volume IV will throw more light on these 
matters. 

The other important case occurs when c = a,* = 0 and b( = Si„, so that the 
metric is 


n 


E 


dx l ® dx l 


(x n ) 


2 


with constant curvature Ko — —1. The 2-dimensional case，in particular, gives 
the nicest example of a non-Euclidean geometry (see Problem 1.9-41). 
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ADDENDUM 3 


E. CARTAN’S TREATMENT 


OF NORMAL COORDINATES 


In this Addendum we use moving frames to prove Riemann’s claims about 
the form of the metric in normal coordinates, and in fact obtain stronger re¬ 
sults; a special case of our result is represented by the form of the metric for 
spaces of constant curvature which we found in Addendum 1. Throughout, 
we will be working with a moving frame X\,. t , X n adapted to an orthonor- 

華 » 

mal basis X\ p ,..., X np for M p ; the forms 0 l , 6 l are as defined previously, and 
x 1 , …， denotes the Riemannian normal coordinate system determined by 

/?”•• ， X n p. 


21. PROPOSITION. The quadratic form || || 2 — ^Zfidx 1 ) 1 can be written as 

a quadratic form in the differentials x r dx s — x s dx r • 

• 

PROOF. Consider the functions A l j k on M x M p which satisfy the equations 


( 1 ) 




dt 2 

^(0, X) = 0 


° 旬 " r + L (R/ 〜 。 少 M;〆〆 


r 




dA 


jfc 


l dt 


( 0 , X) — 0 . 


We claim that 


( 2 ) 


0 Z = t dt^ + 〉: Aj^t^ dt^. 

j，k 


To prove this, we simply note that if we define d l by this formula, then an easy 
calculation shows that the 0 l satisfy the equations and initial conditions of Corol¬ 
lary 9; since the solutions are unique, the result follows. 

Now note, by skew-symmetry of R z 7 ^/ in the last two indices, that A l . k + A l k j 
satisfies a linear second order differential equation, with initial conditions 

(4t + 4 7 )(u) = o 

+ 4i)( 0 ,1) = 0. 
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It follows that 


⑶ 


A )k = — A )cj- 


[The motivation for this proof is the following. The equations in Proposition 8 

■ • 鲁 

clearly imply that 6 l — t dt l = 0 at (r, 0). This means (by Lemma 1.3-2) that 
we can write 9 l as in (2). Then the equations in Corollary 9 imply that the A\ k 
satisfy (1).] 

Next consider the functions B r s 】 k which satisfy the equations 


(4) 


d 2 B 


sjk 


dt 2 

B r sj M ， X) 

dB[ 


sjk ° + stil 

M，/ 

0 




sjk 


dt 


(0, X) = 0. 


We claim that 


(5) 




E 〜 


This is proved by checking that if we define A r . k by (5)，then equations (4) imply 
equations (1). 

Using the skew-symmetry of ft r s jk in j and k and the skew-symmetry of A l . k 
in j and k [equation (3)]，we easily see that 


⑹ 


B 


sjk 


- B ：kj 


Also, using skew-symmetry of f^ r s jk i n r and we see that 


⑺ 


B hk 


—K jk 


We know that the expression for || || 2 = J2j(0 1 ) 2 in the Riemannian normal 

coordinate system x\... ,x n is what ^2i(0 1 ) 2 becomes when we set ? = 1 and 
t l = x l . So by (2) we have 




2 


1 ]( 心 ) 2 + Y.(Y. Ai jk x] dx 


2 


k 


j,k 


E(E 

r \j，k 


A r jk dx r x J dx 


k 
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Now (3) implies that the first triple sum is a quadratic form in the x r dx s — 
X s dx r . The second triple sum can be written 

B r - k x s dx r dx k \ 

tmmd S J K 7 

using (6) and (7)，we see that this can also be written as a quadratic form in the 
x r dx s — x s dx r . ❖ 

/ 

For further developments, see E. Cartan’s Lemons sur la Geometric des Espaces de 
Riemann, pp. 242 ff. 



CHAPTER 8 

CONNECTIONS IN 
PRINCIPAL BUNDLES 


f I 1 he method of moving frames turns out to be surprisingly powerful, but it 
丄 leads us to a definition of a connection which does not have the u invari- 
ance” property of the Koszul definition (although it is a big improvement over 
the classical definition, simply because the transformation law can be stated so 
much more elegantly). At the same time, we should note a certain deficiency 
in our proofs of the Test Case. We certainly have enough of them (six so far), 
and they use the integrability conditions i? = 0 in many different ways. In 
the first proof we use the classical integrability theorem, and in the second and 
third proofs we essentially reprove this theorem. In the fourth proof we use the 
differential form version of the Frobenius Theorem, and in the fifth and sixth 
proofs we use the proof of the integrability theorem outlined in Problem 1.6-8. 
However, in all this time we have never used the distribution formulation of the 
Frobenius Integrability Theorem (1.6-5), although this is the most geometric 
version of all. 

These two phenomena will eventually turn out to be closely related, but for 
the present we will concentrate on the first problem. The main step in the 
solution of this problem was accomplished by Ehresmann* in 1950 . With the 
advantages of hindsight we can reconstruct the solution in a way that makes it 
seem natural and almost obvious. 

It will be helpful to begin by reconsidering the classical and modern def¬ 
initions of vector fields. As we pointed out long ago, the snazziest modern 
definition of tangent vectors is essentially the same as the classical definition, as 
«-tuples of numbers which “transform” according to certain rules. On the other 
hand, the modern definition of a vector field represents a definite improvement 
over the classical definition. Instead of dealing with w-tuples of functions trans¬ 
forming according to certain rules, we make the set of tangent vectors into a 
new manifold, the tangent bundle, and define vector fields to be sections of this 
vector bundle. The idea behind the modern treatment of connections is to ob- 


* Ehresmann, Les connexions infinitesimales dans un espacefibre differentiable, Colloque de Top- 
ologie, Bruxelles (1950), 29—55. 
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tain a bundle whose sections are just the moving frames on M. To do this, we 
will imitate the construction of the tangent bundle in a rather straightforward 
way. 

Recall that a frame w for M p is just an ordered basis w = (wj,..., u n ) for M p . 
Let F(M) denote the set of all frames u for all tangent spaces M p . We call F(M) 
the bundle of frames of M, and define n : F(M) — M to be the map which 
takes a basis u for M p to n(u) = p. If (x, U) is a coordinate system on M and 
G C/, then every frame w = (wi,..., u„) for M p can be written uniquely as 


n 


Uj 






7T(M) 


The matrix (xj(w)) is non-singular, and any non-singular matrix can occur, so 
the map 

u {x l {n{u)),x l -{u)) g M w x GL(«,M) 

is a one-one map from n~ x (U) onto x(U) x GL(«, M). It is easy to see that if 
(y, V) is another coordinate system, then ^o(xn) -1 is C°°, from x^(n~ l (U HK)) 
to n ^)). This means that we can make F(M) into a C 00 manifold 

in such a way that each is a diffeomorphism; with this C°° structure on 
F(M), the map n is clearly C°°. Notice, finally, that we have a C 00 map 
F(M) xGL(/z ， M) — F(M), given by (w, A)\-^ u-A, where {u- A)i = M u j'^ 
we have u • A = A only when A = I, and the set of all w • A for A e GL(«,M) 
is just n~ l (n (u)). 

Although we have called F(M) the “bundle of frames”，it is clearly not a 
vector bundle; each fibre n~ x (p) is diffeomorphic to GL(«,M), rather than to 
some R n . However, F(M) is another special sort of “bundle” which we will 
now define. 


Let M be a C°° manifold, and G a Lie group. A (C°°) principal bundle 
over M ， with group G, is a triple (P, 7r, - ) where 

(1) P is a C°° manifold (the total space of the principal bundle) 

(2) it : P — M is a C°° map (the projection map of the bundle) 
onto M (the base space of the principal bundle), satisfying 

n(u - a) = 7t(a) for all w g P and a e G 

(3) the map • (the action of G) is a C°° map (u,a) \-^ u • a from 
P x G to P with 

u - (ab) — {u • a) • b for all w g P and a,b e G 
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such that the following “local triviality” condition holds: 

For each p e M there is a neighborhood U o( p and a 
diffeomorphism t : n~ x (U) —^ U x G of the form 

t{u) = ( 7 T(W) ， 0 (W)) 

where 0 satisfies <f>(u - a) = <f>{u)a [the latter product being 
the product in G]. 

From the condition n{u - a) = n{a) we see that {u • a : a € G} C (k(u)). 
Using the property <f>(u . a) = <f>{u)a of the map 0 ， we see that we actually 
have {u • a : a e G} = n~ x (k(u)): for, if i; G P satisfies n{v) — n{u), and 
<f){v) = ^>(u)a for a e G, then = <f>(u • a) and n(v) = n{u • a), so v = u • a. 
Notice also that if m • a = m for some u e P, then a = e. 

Each “fibre” 7i~ l (p) of P is clearly diffeomorphic to G. If p = n{u) for 
u e P, and i : n~ x (p) — P is the inclusion, then the image of the tangent 
space (p) u ) is a subspace V u of P u , called the vertical subspace at u\ 

tangent vectors in this subspace are called vertical tangent vectors at u. Clearly 
Y e V u is vertical if and only if 7 r* F = 0 . 



The simplest example of a principal bundle is M x G, with n: M x G ^ M 
the projection on the first factor, and (p,a).b = (p,ab). This is called the trivial 
principal bundle with group G. So far, we have given only one other example of 
a principal bundle, the bundle of frames F(M), with group GL(w, E). However, 
we can use the construction of this bundle to acquire many other examples. If 
n : E ^ M is any C°° vector bundle over M, we can let F(E) be the collection 
of all frames u for the vector space n~ l (p), for all e M; the projection map 
m : F(E) — M takes a frame u for n~ l (p) into zu(u) = p. Consider, in 
particular, the Mobius strip as a 1 -dimensional vector bundle n : E ^ S x over 
5 1 . A frame in a 1-dimensional vector space is just a non-zero vector, so F(E) 
consists of the Mobius strip with the 0 -section deleted. This space is connected 
(cut a paper Mobius strip along the center if you don’t believe it); more generally, 
a vector bundle n ' E — M over a connected space M is orientable if and only 
if F(E) is disconnected. 
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Examples of a different sort are obtained if we begin with a vector bundle 
n •• E M equipped with a Riemannian metric, and define 0 (£) to be the set 
of all orthonormal frames u for n 一 1 (p )，for all p G M. This is a principal bundle 
over M with group 0(n). In the case of a 1 -dimensional bundle n: E — M, 
every fibre of 0 (£) has exactly 2 points, and 0 ( 1 ) = Z 2 ； the action of the 
non-zero element of 0(1) on O(E) interchanges these 2 points in each fibre. 
For the Mobius strip，the principal bundle 0 (£) looks like the picture below. 



More precisely，the total space of O(E) is a circle S 1 ，and the projection map 
S 1 -> S l is given by 6 \-^ 2d. 

In the case of an oriented bundle n \ E -> M, these constructions can be 
modified to give principal bundles SF(£') and SO(£*)，the bundle of positively 
oriented frames, and positively oriented orthonormal frames, respectively. For 
1 -dimensional bundles, the total space SF(£') looks like M x (0, 00 )，while SO(£') 
looks just like M. For 2 -dimensional bundles, SO(£*) has a circle above each 
point of M\ the action of 0 € SO(2) = 5 1 rotates each circle through an angle 
of 6. 

Although examples in which the group G is discrete are the least interesting 
for us, we will nevertheless give two more，as examples of principal bundles are 
hard to come by. In the first example, the group is the integers Z，the total 
space and base space are E and S 1 ， respectively, and the map 丌 ： E S 1 is 



丌 


7 r( 0 ) = (cos 0， sin 0). In the second example, we define n: S 2 IP 2 to be 
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n(p) = {p, —p }； the group is Z 2 and the action of the non-zero element is to 
take p e S 2 to —p. 

Before discussing principal bundles in particular, we need some generalities 
about Lie groups acting on manifolds. Consider a Lie group G, a C°° mani¬ 
fold M, and a C°° map (p,a) p . a from M x G to M. We say that G acts 
on M on the right (via this map) if 

(1) the map R a : M — M defined by R a (p) = p - a is a. difFeomorphism for 
all a e G 

(2) p - (ab) = {p • a) • b for all G Af and a,b e G. 

Condition ( 2 ) can also be written as R a b = Rb ° Ra .， since each R a is a difFeo¬ 
morphism, it follows easily that R e is the identity map of M. We say that G 
acts effectively if e is the only element a with R a the identity map of M, and 
we say that G acts without fixed point if the following stronger condition holds: 
if /?•« = /? for some p e M, then a = e. 

Now let g be the Lie algebra of a Lie group G which acts on M on the right. 
For every X e g, we have the curve t exp?A" in G\ for each p e M this 
gives rise to a curve c p (t) = p - (exp?X) = R C xptx(p)- We denote Cp’(O) by 
o{X){pY we thus have a vector field o{X) on M, and hence a map cr: fl ^ 
(vector fields on M). The 1-parameter group of diffeomorphisms generated by 
o(X) is <t> ( (p) = p - (exp tX), by the very definition of o(X). It is important to 
note that we can also describe o(X) as follows. For p e M, let a p : G ^ M be 
o p {d) = p . a. Then 

o{X){p) = a p ,(X). 

To discuss this operation a we will also need to introduce an important map, 
the “adjoint map” 

Ad(«) = (L a Ra~ 1 )* = (Ra^La)*'- Q — 

where L a and R a now denote left and right translations in G. Thus Ad (a) is the 
differential at e of the map b aba— 1 = L a Ra~ X {b) = R a - 1 L a (b). Usually 
(Ad(«))(A") is denoted simply by Ad(«)X. If X is the left invariant vector field 
on G with X(e) = X e fl, then 

M(a)X = {R a -\{L a J){e) = [(A 一 ” 氺歹]⑷， 

since L a *X — X. Consider, in particular, the special case where G = GL(«, K), 
so that = 3l(«, K) is the set of all n xn matrices. For any n x n matrix N and 
any A e GL(«, K) we have 

Ad ⑷ TV = ANA~\ 

since La* = La and Ra* = Ra, because La and Ra are linear functions 
(compare Problem 1.10-19). 
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1 • PROPOSITION. Let G act on the right on M. Then 

(1) The map a: ^ (vector fields on M) is linear. 

(2) a([X 7 Y]) = [a(X),a(Y)]. 

(3) If G acts effectively and X ^ 0 7 then o(X) is not the zero vector field. 

(4) If G acts without fixed point and X ^ 0, then o(X) is nowhere 0. 

PROOF. Linearity is clear from the equation o{X){p) = a p ^(X). 

To prove (2) we note that since the bracket of two vector fields is the same as 
the Lie derivative (Theorem 1.5-10)，we have 

(1) Y] = [X, Y](e) = hni - [Y - (R exphX ),Y(e)] 

=lim y[Y - Ad(exp -hX)Y]. 
h 一 ^0 h 

(Compare Problem 1.10-19.) 

On the other hand, if R a now denotes the map p \-^ p • a from M to G，then 

(2) R—x 。 o p .^ v - hX ){a) = p - ([exp -hX]a exp hX). 

Since <t> t (p) = p - (exp tX) is the 1 -parameter group of diffeomorphisms gener¬ 
ated by o(X), on M we have 

[a(X) 7 a(Y)](p) = [R exph xho(Y)(p)] 

=Urn ^ [a p ^Y - cr^(Ad(exp -hX)Y)] by ⑵ 

=f lim j[Y- Ad(exp = a p ^([X,Y]) by (1) 

\h—Qh ) 

= o{[X, Y]). 

To prove (3 )， suppose o(X) = 0. Then for every p the 1-parameter group 
of diffeomorphisms (f>t(p) = p - (exp tX) must be (f>t(p) = p (for the unique 
integral curve c of o(X) through p is clearly c{p) = p). If G acts effectively ， 
this means that all exp tX — %o X = 0. 

To prove (4 )， suppose a(X)(p) = 0 for some p. Then p - (exp?X) = p for 
all t. If G acts without fixed point, then all exptX = e 7 so X = 0. ♦♦♦ 
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Let us now apply this construction to a principal bundle 7t •• P — M, with 
group G. The map • from P x G ^ P is an action of G on P to the right (the 
map m 卜 m -a is a diffeomorphism by condition (3) of the definition) and G acts 
without fixed point (we have already pointed out that this follows from condi¬ 
tion (3)). Therefore we have the fundamental vector field o(X) corresponding 
to X for all X e g,; for every u e P, the map X \-^ o{X){u) is an isomorphism, 
since G acts without fixed point. Since the maps R a : P ^ P take fibres to 
themselves, the set of all a(X)(u) is precisely the set of vertical vectors at u. 

2. PROPOSITION. For all X g g and a e G, the vector field (R a ha(X) is 
the fundamental vector field 

(R a )^o(X)^a(Ad(a- 1 )X). 

PROOF. Since = p-exptX = Rexptx(p) is the 1-parameter group of dif¬ 
feomorphism s generated by o(X)^ it follows from Lemma 1.5-11 that 
generates the 1-parameter group of diffeomorphisms 

♦t = Ra ° ^exptX ° Ra — ^a~ l exp tXa - 

Now {a~ l exptXa} is the 1-parameter group of diffeomorphisms of G gener¬ 
ated by Ad(a~ 1 )X. So yfr t generates cr(Ad(a _1 )A")♦ 

The vector fields o(X) are rather difficult to picture, especially since most 
principal bundles themselves are impossible to visualize. Nevertheless they will 
be very important, as we shall see upon returning to the structure which began 
our whole discussion, the principal bundle of frames F(M). A section s of this 
bundle over an open set U C Af is a C°° map s.. U — F(M) with n o s = 
identity map of U. Clearly, a section s is just what we used to call a moving 
frame on U. Note that, unlike a vector bundle, which always has a section 
defined on all of Af，namely the 0-section, a principal bundle need not have 
such a section. In fact, if the principal bundle n : P ^ M over M with group G 
has a section s' M — P, then the bundle is trivial: we can let (/?, a) e M x G 
correspond to s(p) • a e P. 

We can use some of this new language to give an alternative，but com¬ 
pletely equivalent, definition of a Cartan connection: A Cartan connection 
is an assignment of an « x « matrix-valued 1-form a) s [= (^j)] to every section 
s.. U — F(M) in such a way that 

(*) o) s . a = a~ x da + a~ l a) s a 
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for every C°° function a' U — GL(«,M). (Here s • a is the section (i 1 - a){p) = 
s(p). a(p), the - denoting the action of GL(«, M) on F{M)) This formulation 
of the definition suggests how we may obtain a definition of a connection which 
has all the advantages of the Cartan definition but which is also “invariant”. 
We ask if there is an « x « matrix-valued 1 -form co on the manifold F(M) such 
that for each section (moving frame) s we have 

co s = 5*( 出)； 

notice that previously we used co alone to denote the connection form for some 
moving frame, but from now on we will have to be careful to use subscripts to 
distinguish the forms on M from the form co which we hope to find on F(M). 
Since the forms co s for a Cartan connection satisfy (*), our question is then the 
following: 

If we are given a collection of co s satisfying (*)，is there an co on F(M) 
such that each co s = ^*( 0 ;)? More generally, which n x n matrix¬ 
valued 1-forms co on F(M) have the property that for every section 
s ' U — F(M) and GL(«, M)-valued function a on f/ we have 

(**) (*s • a)*(co) = a~ x da + - a? 

In order to answer these questions, we need to know something about the 
section s . a. If a has the constant value A 6 GL(«, M), then s . a = Ra 0 s, 
where Ra(u) u • A, the dot denoting the action of GL(«, M) on F{M). So 
for any tangent vector X p 6 M p we have (i 1 - a)^{X p ) = Ra^{s^X p )\ when a is 
not constant there is a correction term. 

3. PROPOSITION. Let be a section of F(M), over some open set U, and 
let a: U GL(«, M) be C°°. Then for any tangent vector X p at a point p e U 
we have 

(s - a)^{X p ) = R a (p)*(s^X p ) + a(a(pr l - X p (a))(s(p) • a{p)). 

[Note that X p {a) is an « x« matrix, so that a(p)~ l -X p (a) is also an « x« matrix, 
and hence may be considered as an element of M), so a{a{p)~ x - X p {a)) 
is a vector field on F{M)^\ 

PROOF. For convenience, let 

m: F(M) x GL(n,R)^ F(M) 

be m(u. A) — u-A. Remember that the tangent space W of F(M) xGL(«, M) at 
(u. A) is isomorphic to the direct sum 尸 (M) w © GL(«, M)^, so we can consider 
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every element of VF as a pair (Fi, Y 2 ) = K ㊉ h，where Y\ e F{M) U and 
Y 2 G GL(«, M)^. Note that if 

C\ is an integral curve in F{M) for Y\ 

C 2 is an integral curve in Gh(n, M) for Y 2 , 

then 

t i-^ m(c\(t), A) = C\{t) - A is an integral curve for A" ㊉ 0 

t i-^ m(u, c 2 (0) = w - C 2 (t) is an integral curve for 0 ㊉ F. 

Now let c be an integral curve for X p . Since s • a = m o (s, a), we have 

(*s • — /77*(*s*(A^) ， a*(A^)) = w*(*s*(A^) ㊉ a*(A^p)) 


d 

~dt 


s(c(t))a(p) + 


/ =o 


d_ 

dt 


s(p)a(c(t)). 


：o 


The first term on the right can be written as 

d 


dt 




ds(c(t)) 


t—Q 


dt 


/ =0^ 


To identify the second term, recall that for all m g F(M) we have 

a(N)(u) = o u ^(N), where o u {A) = u • A for A e GL(n,R). 

We write 

s(p) - a{c{t)) = s(p) - a(p) • [a(p)~ l - a{c{t))}. 

The term in brackets gives a curve y(t) = a(/?)— 1 • a(c(t)) in GL(«, M) with 
y(0) = /; then }/’(0) e GL(n, M)/ = gl(«, M) is the tangent vector corresponding 
to the matrix 


d_ 

dt 


：o 


a{p)~ X - a{c{t)) = a{p)~ x • ^ 


a{c{t)) = a{p)~ x • ⑷: 


/ =o 


when we identify the n x n matrices with gl(«, M). Consequently, 


d_ 

dt 


s(p)a(c(t)) 


t—0 


d_ 

dt 


s{p) a{p) • [a{p)~ x a{c{t))} 


/ =0 


d 


^s{p)>a{p)* I ^ 




=0 




1 • Xp{u)) 

o{a{p)~ x - 心⑷冲 ⑼ • a{p)). ❖ 
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Remark: Proposition 3 can actually be formulated for any principal bundle 
7t. P — M over M, with group G. If 5 1 is a section over U and a : U — G is 
C°°, then for any tangent vector at g f/ we have 

(s - a)^{X p ) = R a{p)i ,{s i ,X p ) + o {La^-x^iXp)) (s(p). a{p)). 

With Proposition 3 at hand, let us reconsider our question. We want to know 
which n x n matrix-valued 1-forms a) on F(M) have the property that for each 
section s we have 

(**) (5 1 • a)*((o) = a~ x da + a~ x s*{a))a. 

This is equivalent to saying that for every X p g M p we have 

⑴ ((s • a)^X p ) = a~ x {p)X p {a) a~ x {p)a){s i ,X p )a{p). 

According to Proposition 3 this is equivalent to 

(^{Ra{p)^X p )) + a)[o(a{p)~ x - X p {a))(s{p) • a(p)fj 

=a~ l (p)X p (a) a~ l (p)a)(s^X p )a(p). 

We can extract two separate equations from this, as follows: 

(1) First suppose that a(p) = I. Then we obtain 

d)(s^X p ) + (o{o{X p {a)){s{p))) = X p {a) + a)(s^X p ), 
or 

(I) co(a(X p (a))(s(p))) = X p (a) for a{p) - /. 


(2) Now suppose a has the constant value A. Then we obtain 
(II) (d{Ra^{s^X p )) = A~ l (o(s^X p )A. 

Conversely, it is not hard to show that if a> satisfies (I) and (II), then a> satis¬ 
fies (**)• 

Now the equations (I) and (II) can be simplified considerably, resulting in 
equations that do not involve s at all. In (I), the matrix X p (a) can obviously 
be any n x n matrix, since we just have to satisfy a(p) — I. Since we can also 
choose any 5 1 ，we see that (I) is equivalent to 


(10 


a)(a(N)(u)) = N 


for all N G and u G F(M). 
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We claim that (II) is equivalent to 

(II ，） (o(R a *Y) = A~ l (v(Y)A for all Y e F{M) U . 

To see this we note that by choosing s appropriately we can make s^X p be any 
vector in m = s(p) that is not vertical; since the vertical vectors are all of the 
form <y(N )， equation (II’）for vertical vectors follows from (I’）and Proposition 2, 
remembering that for GL(w, R) we have Ad(^4)A^ = ANA~ l . 

Summing up, we see that a matrix-valued 1-form a) on F(M) satisfies 

(s - a)*(a)) — a~ l da + a~ l s*(a))a 

[and consequently the assignment of s*a) to 5 1 is a Cartan connection] if and 
only if 

a)(a(N)) = N for all N e gI(w ， M) 

a)(R A ^Y)) = A~ l a)(Y)A = Ad(A~ l )a}(Y) for all F g F(M) u . 


We leave it to the reader to show that if we are given a Cartan connection 
{o> s }，then there is a unique such a> on F(M) with (o s = (Define 

a)(Y u ) = (o s (X p ) whenever Y u = s^X p , and use Proposition 3 and the trans¬ 
formation rules for a Cartan connection to verify that <o is well-defined.) We 
are consequently ready for the final definition of a connection; since all our con¬ 
ditions make sense in any principal bundle, our new definition is not only more 
abstract, more elegant, and more incomprehensible, but also more general. 

An (Ehresmann) connection in a principal bundle n:P^M over M 

with group G is a C°° g-valued 1-form a) on P such that 


(1) co(a(X)) = X for all ^ g Q 


(2) (o(R a ^Y) = Ad(a~ l )a)(Y) 


for all a e G 7 and all 
tangent vectors Y on E. 


If (o is an Ehresmann connection, then the map (o{u ) : ^ is onto for 

every m g P, by (1), so its kernel H u = kerft>(w) is a subspace of P u having 
the same dimension as M. This subspace is called the horizontal subspace at u 
(determined by the connection), and tangent vectors in H u are called horizontal. 
Thus every Ehresmann connection a) on P gives rise to a certain distribution H 
on P. 
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4. PROPOSITION. If H is the distribution on P determined by an Ehres- 
mann connection then 


(1) P u = H u 

(2) H ua = (Ra)*H u 

(3) // is a C°° distribution. 


Conversely, if // is a distribution on P satisfying (1)—(3)，then H is the distribu¬ 
tion determined by a unique Ehresmann connection a). 


PROOF. Condition (1) is obvious from the definition of H u as ker ⑴ (m) (and the 
fact that o)(u) is onto g). 

If Y g H u , then 


o)(u - a)(R a *Y) = a)(R a ^Y) 

=Ad(a- l )(o(Y) 
— 0 , 


by condition (2) in the 
definition of a connection 


so R a *Y ^ H u . a . Since R a ^ is one-one, and dim H u = dim H u . a ^ it follows that 
Hu.a = (及 a)* 只 

To prove that // is a C°° distribution, choose vector fields Fi,..., ..., 

which ^pan P v for all d in a neighborhood of u. Let ， . • • ， be a basis for Q ， 

so that we can write a> = (x)^ - Xj for ordinary C°° 1-forms (jl> } on P. Let F/ 

be the vector field 

j 

The C°° vector fields F/ are clearly horizontal and span the distribution H in 
a neighborhood of u. 

Conversely, given H, we (must) define a> by (jo(Y) = 0 for Y horizontal and 
a)(a(X)) — X for X e g,. Then a) is C °° ， since o)(Y) is C°° when F is a 
horizontal vector field, or when Y = o(X) 7 and these vector fields span the set 
of all vector fields，over the C°° functions. Condition (1) for a connection holds 
by definition of a). To prove condition (2)，we need only prove it for horizontal Y 
and vertical Y. If Y is horizontal, then is also, by condition (2) on i/, 

so we have 

⑴ ((D*F) = 0 = Ad(a - b ⑴ (F). 

When Y is vertical, we may assume that Y = o(X) for X e g,. Then (D*F = 
cr(Ad(a-*)1) by Proposition 2. So 

co((R a ^Y) = Ad(a~ l )X = Ad(a~ l )a)(Y) 7 


as desired♦ 
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Often, a connection is defined to be a distribution H satisfying (1)-(3) of Propo¬ 
sition 4, and a> is defined as in the second part of the proof—it is then called 
the “connection form” for the connection H. Using the decomposition given 
by (1) of Proposition 4, we can write, for any tangent vector Y G P u , a unique 
expression 

Y = v(Y)+h(Y) 

where v{Y), the vertical component of Y, is vertical and h(Y) 7 the horizontal 
component of F, is horizontal. As we noted in the proof of Proposition 4 ， 

h(Y) = Y-J2<o J (Y)a(Xj). 

j 

From this formula it is clear that h(Y) 7 and hence u(F), is C°° if Y is C°°. 

From the decomposition / ^ ㊉ H u and the fact that V u is the kernel of 

兀 *. Pu — it is also clear that 7r* : H u — M n ( u ) is an isomorphism for 

each u e P. Consequently, for every vector field X on M there is a unique 
vector field X* on P such that X* is everywhere horizontal and 7r*(A"* w )= 
X n ( u ) for all m g P; this vector field X* is called the lift of X. There are two 
simple propositions about lifts. 

5. PROPOSITION. If A" is a C°° vector field on Af, then X* is a C°° vector 

field on P, and for all a G G we have = X*. Conversely, if F is a 

horizontal vector field on P such that = Y for all a G G, then Y = X* 

for a unique vector field X on M. 

PROOF. Using local triviality, we can choose a C°° vector field X 1 on some 
n-\U) C P such that n^{X f u ) = X n(u) for all w e n~ x {U). Then X* = h{X f ) 
is also C°°. The other parts are left to the reader. ♦ 

6. PROPOSITION. If X* and Y* are the lifts of vector fields X and Y on M, 
then 

(1) X* + V* is the lift of X-hY 

(2) for every /: M —^ M we have {fX)* = (/ o 7 r) • A"* 

(3) h([X*, Y*]) = [X, Y]*. 

PROOF. The first two parts are trivial. For the third, we note that X* and X 
are 7T-related, so by Proposition 1.6-3, 

7t,(h[X\ r*] w ) = 7 t^[X\ Y*] u ) = [X 7 Y] n(u) . ❖ 
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Our aim now is to develop, and generalize, the material of the last three 
chapters, from the point of view of Ehresmann connections. The topics to be 
discussed include parallel translation, covariant derivatives, the curvature and 
torsion tensors, the structural equations, and the Bianchi identities. Presumably, 
somewhere along the way an elegant proof of the Test Case will also turn up. 
Be prepared for some unappetizing complexity — this is the price of invariance! 
One problem is that all the interesting information about a connection resides 
in the horizontal subspaces, but formulas about forms on a principal bundle 
also have to reckon with the vertical vectors [which is what clause (1) in the 
definition of an Ehresmann connection is designed to do]. 

We begin with a discussion of parallel translation. A piecewise C 1 curve 
y ： [0,1] —> P is called horizontal if all tangent vectors and y f {t~) are 

horizontal vectors. Now if c : [0,1] —> M is piecewise C 1 , we define a lift of c 
to be a horizontal curve c*: [0,1] P such that c* covers c, i.e., such that 
n oc* = c. Notice that if c* is a lift of c, then so is R a o by Proposition 4(2). 



7. PROPOSITION. Let c: [0,1] —> M be a piecewise C 1 curve, and choose 
Wo € P with n(uo) = ^(0). Then there is a unique lift c* of c with c*(0) = Mo. 

PROOF. Using local triviality of the principal bundle, it is easy to show that 
there is a curve y: [0,1] —> P with y(0) = Mo and n o y = c. A lift c* of c 
must be of the form c^{t) = y(t) . a(t) for some «: [0,1] —> G with a ⑼ =e. 
Using the method of proof for Proposition 3 (see also the Remark following it), 
we find that 

R a{th (y f (t))a (L a{t) -i^a f (t)) (c*{t)). 

Consequently, 

= Ad(a(ty x )a)(y f (t)) + ⑴. 
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Now c* is horizontal if and only if co(c* f (t)) = 0; using the above equation, and 
remembering the definition of Ad, this means that 

(0 = 一 ⑴- 1 * 及⑴ ( 〆 （’))， 

a (’）= — R a (t)*^{y (0)- 

In this equation, (o{y f {t)) is a given curve in If we introduce a coordinate 
system on G, this equation becomes a differential equation for a, and we know 
that a unique solution exists locally. So we can always find a lift c* defined in 
a neighborhood of any t e [0,1], with c*(t) == u for any given u e n~ l {c{t))\ 
moreover, this lift is unique. 

We now have to show that the lift can be defined on all of [0,1]; clearly 
we just have to show that a lift on [0,/o) can be extended past to. To do this, 
pick a lift c* defined in a neighborhood of to (with any old initial condition 
c*(/o)). Choose t\ < to so that c* is defined at t\ , and then choose a G G with 
c*(h) = c*(h) - a. Clearly we can extend c* past to by letting it be R a o c*. 

Notice that the last part of this argument is just that used in the proof of 
Proposition 1.5-17. The first part of the argument is much simpler when we 
are looking for a lift in a neighborhood of a point t with c f {t) ^ 0. For then (a 
portion of) c is the integral curve of a vector field X on M, and c* is just an 
integral curve of the lift X*. 

Using Proposition 7, we can now define parallel translation of fibres of the 
principal bundle n \ P M along any curve c: [0,1] M. For any u e 
we let T t (u) e n~ l {c{t)) be c*(t), where c* is the lift of c with c*(0)= 
u. In this way we obtain a map 

T t : tt _1 (c(0)) — n~ l {c{t)). 

It is clear that z t o R a ^ R a o r r , since R a 0 c* is again a lift. The map x t is a 
difFeomorphism whose inverse is just the parallel translation along the reversed 
portion of c from t to 0. 

Consider, in particular, the principal bundle of frames n : F(M) M. Every 
frame u e F{M) determines an isomorphism from M. n to namely, we 

send et e W 1 to ui e M n ( u ). This isomorphism will be denoted by the same 
letter, M n ( u ). It is easy to check that for ^ G we have 

(w • a)(^) = u(a - ^), 

where the product a • ^ of an « x « matrix a and a vector ^ eW 1 is defined in 
the footnote on page 262. Now suppose c: [0,1] M is a piecewise C 1 curve, 
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X p € M p is a tangent vector at ^ = c(0), and c* : [0,1] ^ F{M) is a lift of c 
with c*(0) = w. There is a unique ^ with c*(0)( 专） =m(^) = X p \ we let 

thus defining parallel translation of vectors. To check that this parallel trans¬ 
lation is well-defined, we consider any other lift, which must be of the form 
c* = R a oc*. Then 

X p = c*(0m = c*(0)-a(a~ l 
^c^OKa- 1 ^), 

so for the parallel translation x t defined with respect to c* we have 

r t (X P ) = P ⑴ (a— 1 •夸卜 c*(0 • a(a- ! ⑴⑻ 

If we choose c*(0) = w so that m((1,0, ... ,0)) = X p , we see that we can parallel 
translate X p by making it the first vector in a basis, parallel translating the basis, 
and then taking the first vector in the translated basis. It is also easy to see that x t 
is a vector space isomorphism. 

Having defined parallel translation of vectors, we can now define covariant 
differentiation of vector fields by the formula on page 234, 

(*) v^r = 

h~^0 n 

where c(0) = p and c’(0) = X p . It is not yet clear that this covariant differenti¬ 
ation is the same as the one we obtained on page 285 from the corresponding 
Cartan connection to prove this we will need a Lemma that is also 

used frequently later on. Given the vector field Y, we consider the function 
/y: F(M) R n whose value at a frame v is just the set of components of 
Y n ^ v ) with resect to f —-in symbols, 

f Y (v) = v~ l (Y n{v) ). 

8. LEMMA. Let F be a vector field on M, and let X p € M p . Then for any 
u € F(M) with p = n(u) we have 

V Xp Y = u{X\{f Y )\ 

where X* u e P u is the unique horizontal vector with n^(X* u ) = X p [notice that 
since /: P E w , the value X* u (fy) of the vector X* u on fy is an element of 
® w ，so uiX^^fy)) € M p makes sense]. 



Connections in Principal Bundles 321 

PROOF. Let c be a curve with c(0) =： p and c^O) = X p , and let c* be the lift 
of c with c*(0) = u, so that c*’(0) =： X* u . Recall the definition of the parallel 
translation of ^c{h) along the reversed part of c from 0 to /r. we 

choose ^ g M w with 

c\hm = Y c{h) or c\hy\Y c{h) ) = H 

and then define 

r h~ l {y c (h)) - ⑼ (?) = w(^). 

Consequently, 

uoc%hy\Y c{h) ) = r h -\Y c{h) ), 

So we have 

V x J=\\mJ-[x y -\Y c{h) )-Y p ] 

=lim j [w 。 c*{h)~ l (Y c (h)) 一 u o u_ l {Y p )] 
h—oh L J 


9. PROPOSITION. The V defined by (*) is a Koszul connection; that is, 

(1) v w; r = v^r + v^ ; r 

(2) V Xp (Y^Y 2 ) = v^r, + V Xp Y 2 

(3) ^aX p Y = aVj^F for all a g M 

(4) Vx p (fY) = f(p) .V^r + X p {f) . Y p 

(5) if X and Y are C°° vector fields，then so is ^ Vx p Y. 

PROOF. Equations (2) ， (3)，and (4) are easy to prove from the definition. Con¬ 
dition (5) follows from Lemma 8, since we can choose a C°° section p u(p) 
in a neighborhood of p. Equation (1) also follows from the Lemma，since we 
have (X p + X f P Y - + X f \. 
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Now let us compare this Koszul connection V with the one obtained from 

the Cartan connection corresponding to an Ehresmann connection (o on F(M), 

• • 

We can write a) = {(Oj), where each ⑴ j in the matrix is an ordinary 1-form on 
F{M). Equivalently, we can write 


(x> — 




where Ej is the matrix with zeros everywhere except for a 1 in the z th row and 
the y th column, so that 

Let (x, U) be a coordinate system, and let s : U F(M) be the “natural 


section 




s{q) 




dx l 


q 


，…， 


q 


Then the Cartan connection corresponding to a) assigns {s*(Oj) to this moving 
frame. So the operation V defined on page 285 is determined on U by 



10. PROPOSITION. These two connections are the same，V — V. 

PROOF. It obviously suffices to show that ▽a/a A ： /3/3x j ^ = d/dx J \ since 

V and V are both Koszul connections. Let / = /g ^ x j . By Proposition 8, it 
suffices to show that 



So we need to show that 
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Now, by definition of / we have 


(*) 


Y^f k (u)u k 


k 


dxJ 


for all m = (wi, ..., u n ) g F(M). 


jr(M) 


In particulai 


f k {s{q)) = Sj. 


Writing equation (*) for u — s(p) - exptN, and differentiating with respect to t, 
we obtain 


E 


k 


d_ 

dt 


f k (s(p)-exptN)^ 


t~o 


dx k 


p k 


d_ 

dt 


(s(p) - exp tN) = 0, 


：o 


and hence 


⑼办 )( 产 ) 


k 


dx k 


-J2S^(s(p)^N) k = -(s(p)^N)j, 


p 


k 


In particular, 


Emw 户 ) 


k 


dx k 




a 


p 


dx^ 


p 


< s j E 


/3 


dx^ 




p 


dx^ 1 


p 


So 


^K) s{P) if k ) 


[ 一 ^ k = fi 

0 k ^ fi. 


Now the lift (d/dx^* of d/dx 1 at points s(q) is given by 


* 


a 汐 


+( 占 ))= 、 ( 士卜 卜 (‘)) <)• 


So 


鼠 ( 产 ) 


3(f k o s) 




dx l 


d(Sf) 


(f)- 




p 



o{ElU p) {f k ) 




L( 
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There is no need to repeat here the definition of Vx A for arbitrary tensor 
fields A; this can be defined in either of the two ways used in Chapter 6, and 
we will use any results from that chapter which we require. 

We return for a moment to a connection ⑴ on a general principal bundle 
n : P ^ M with group G. Consider a A:-form a on P, with values in a vector 
space V. We define a K-valued (A: + l)-form Da, the covariant differential of a y 
by 

Da(Y \ y ..., >a ： +i) = {da){hY \ y ... 

where d is the ordinary differential and hY is the horizontal component of Y. 
In particular, we define the curvature form of a; by = D(o. Thus D is a 
g-valued 2-form on P. 

11. PROPOSITION. For all a € G，we have R a *n = Ad (a 一 1 )^2. In other 
words, for all tangent vectors Y\ y Y^ € P u we have 

， r 2 ) = Ad(a 一 1 ) n ⑺， r 2 ) 

[this makes sense since Y 2 ) € g]. 

PROOF. We have 

R a *^(Y u Y 2 )^ R a *(d(o)(hY u hY 2 ) 

= d(R a *(o)(hY u hY 2 ) 

=d(Ad(a- l )a))(hY u hY 2 ) 

^ Ad(a- l )[da)(hY u hY 2 )] since AdCa -1 ): g — 只 

is linear 

=Ad(a - ” ⑽， r 2 )♦ 

We will eventually see that this Q does indeed correspond to the Q in Chap¬ 
ter 7, but we first introduce the analogue of 0. This analogue cannot be defined 
for connections in all principal bundles, but only for connections a> in the bun¬ 
dle of frames n : F{M) M. On this bundle we have a certain -valued 
1-form 0, defined by 

Ou(Y u ) = u~\it^Y u ). 

We will call this 1 -form the canonical form or the dual form of the principal 
bundle F{M). To see the appropriateness of the latter term, consider a section 
s: U — F{M) given by .s* = {X \,..., X n ). For any tangent vector Y p € M p we 
have 


s*9(Y p ) = ds^s^Yp) = s(p)~ l (Y p ) y 
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so for the z th component 9 l of 0 we have 

s*d l {Yp) — i th component of ^(^) _1 (r p ) 

= i th component of Y p with respect to the basis X\{p) y • •. ， X n {p). 

_ • 

Thus, the s*d l are just the dual forms for the moving frame (X \,..., X n ). 

We now define the torsion form © of a connection a) on F{M) by 0 = D9 
(this depends on a>, since /i, and hence D, depends on a>). 

12. PROPOSITION. For all A e GL(w,R) we have 

R A *e = . 0 , = 乂 ― 1 • ©• 

In other words, for all tangent vectors Y\,Y 2 G P u we have 

R A *d(Y l ) = A- 1 diY,) 

R a *@(Y u Y 2 ) = A - 1 @(Y u Y 2 ) 

[these equations make sense since 0{Y\)^ 0(Fi, Y{) E R”]. 

PROOF. For 9 we have 

Ra* 0(Y\) = 9 u . a (RA*y\) = (w - A)~ l (7t^Y\) 

=A~ l - 
=A~ x -6{Y X ). 


Then for 0 we have 


R A *&(Y l 9 Y 2 ) = R A *(dd)(hY u hY 2 ) 

=d(R A *d)(hY u hY 2 ) 

=d{A~ x - 9 )(hY u hY 2 ) 
= A~ l dd(hY u hY 2 ) 

= A^ d(Y u Y 2 ). ❖ 


A connection co in the bundle of frames F(M) also allows us to define certain 
special vector fields in F{M). For ^ g we define the basic vector field 方⑻ 
corresponding to $ by letting 方 ( 专 ) w be the unique horizontal vector at u such 
that tt* ( 方 ( 专 ） 《) = m(^). In particular, B{ei) u is the unique horizontal vector 
at u which covers m/. 
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13. PROPOSITION. For all ^ g M 71 we have 

( 1 ) = H 

(2) Ra*B(^) = B{A~ X • for all A e GL(w ? M). 

Moreover，if 专 # 0， then 5( 专 ） is nowhere 0. Consequently，if 专 is a 
basis for R 71 ，then B(^\) u , ..., B(^ n ) u is a basis for H u . 

PROOF. (1) and (2) are left to the reader. For the third assertion，note that if 
B{^)u = 0, then 

0 = 7T + (5(^) w ) = m(^), 

so $ = 0. The final assertion follows immediately♦ 

To prove the structural equations in our new setup, we need two lemmas ； the 
first involves fundamental vector fields and basic vector fields，but the second ， 
which holds for connections in any bundle, involves fundamental vector fields 
and arbitrary horizontal vector fields. 


14. LEMMA. Consider the basic vector fields determined by a connection on 
the bundle of frames F(M). For every N e M) and ^ g M 71 we have 


PROOF. Since 4>t(u) = u - exptN = ^exprAr(w) is the 1-parameter group of 
difFeomorphisms generated by we have 


\P(N\ B{^)} - lim 1 - [£ ⑹ - R cxptN ,B(^)] 

-lim- [5 ⑹- 5([exp ~tN] - ^)] 
t 

—B (lim — (exp —tN) - ^) ) 
V^o t ) 

= B(N • ^). ❖ 


by Proposition 13 

since ^ B{^) is 

linear onto H u 


15. LEMMA. Consider a connection on any principal bundle P over M with 
group G. For any X e g, and any horizontal vector field Y on P，the vector 
field Y] is also horizontal. 



PROOF. We have 
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[o{X\ r] = lim - [r - R exptX *(Y)]. 

t^O t L r J 

Since Y is horizontal, so is each ^exp/X*(^). ♦♦♦ 

16. THEOREM. Let cd be a connection on a principal bundle P over M. 
If P is the bundle of frames, with the dual form 0, and the torsion form 0 
determined by co, then we have the first structural equation: 

dd(Y x ,Y 2 ) = - {(o(Y x ) • 6(Y 2 ) - co(Y 2 ) - 0(F,)} + Q(Y u Y 2 ) for all Y U Y 2 6 P u 

[where co(Y\) - 0(^) is the action of the matrix co(Y\) on 0(^) G M n ]. 

If P is any principal bundle, and Q is the curvature form of co, then we have 
the second structural equation: 

dco(Y u Y 2 ) = - [co{Y x ),co{Y 2 )} + Q{Y u Y 2 ) for 3\\Y u Y 2 eP u . 

PROOF. Since each F/ is the sum of a vertical and a horizontal vector, and 
since both sides of the first structural equation are skew-symmetric and bilinear, 
we can prove this equation by considering 3 cases. 

Case 1. Y\ and Y 2 are horizontal. Then (o{Yi) = 0, so the equation reduces to the 
definition of © as D9. 

Case 2. Y\ and Y 2 are vertical. Then the right side is 0. If we extend Y\ and Y 2 
to vertical vector fields Y\ and F 2 , then the left side is the value at u of 

Yx{ 9{Y 2 ))- Y 2 {Q{Y x ))- 6 ([Y u Y 2 }\ 

which is 0, since [Fj, Y 2 ] is also vertical. 

Case 3. Y\ is vertical and Y 2 is horizontal. Let Y\ = o{N) u for N G gl(w, M) and 
let Y 2 = B{^) u for | 6 M n . Then 

- {^(r,)- 9{y 2 ) - co{y 2 ) - 0(r,)} + ©(r,,r 2 ) =-^v-| + o + o. 
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while we have 

d9(Y u Y 2 )^ a(NmB(m(u) - B(^ma(N)))(u) - 9([a(N), B ⑻])⑻ 
= 0 — 0 — Q(B(N . 专 ))(w) by Lemma 14 
=—N - 


This proves the first structural equation. 

The second structural equation will be proved similarly. 


Case 1. Y\ and are horizontal. The proof is as before. 

Case 2. Y\ and are vertical. Let F/ = a{Xi) u for A7 € g. Then Q(Y\, Y 2 ) — 0, 

while 


da)(Y u Y 2 ) = a(Xi)(a)((j(X2)))(u) 

- a(X 2 )((o(a(X l )))(u)- co([a(X,)^(X 2 )])^) 

= 0 — 0 — a)(a([X\, X 2 ]))(u) by Proposition 2 

= -[u 卜- [⑴⑹，⑴ am 

Case 3. Y\ is vertical and Y 2 is horizontal. Then the right side is 0. If we extend Y 2 
to a horizontal vector field Y 2 and let Y\ = a(X) u for A" € 3 , then the left side 
is the value at u of 

g{X){(d{Y 2 )) - Y 2 {(d{g{X))) - a)([a(X), Y 2 ]) = 0 ? by Lemma 15♦ 


The structural equations for a connection co on the bundle of frames F{M) 
can also be written in terms of ordinary forms. With respect to the standard 
basis q ， • • •，〜 of we can write the -valued forms 9 and © as 




© = [ © z - ei, 

争 


for certain ordinary forms 9 l and &, Similarly, with respect to the basis Ej of 
introduced previously, we can write 

<jl> — 〉: d)j - Ej Q = 〉: - Ej. 

攀 ♦ * ♦ 

hj hj 
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It is easy to see that the structural equations can then be written 

d9 ( = — coj a 6 J + 0 1 
J 

d(i/j 二 - > : (O^ A (O^j -|- . 

k 

Instead of introducing these forms explicitly，it will be convenient to write the 
structural equations in the abbreviated form 

dO = —co a 0 + 0 
dco = —a ) 八 co + 

For a section s : U F(M) we obtain 

dis*^)= - 八 s *° j + ， 01 

J 

d{s*(jJj) = - 八 + 作 ) •. 

k 

• * 
Since s*6 l are just the dual forms to the moving frame s, this shows that s*& 

and are precisely the torsion and curvature forms* for the moving frame s 
which are determined by the Cart an connection which assigns s*a) to s. Propo¬ 
sitions 11 and 12 correspond to Proposition 15 of Chapter 7. Recall that the 
formulas of this proposition allowed us to define the tensors T and R. Essen¬ 
tially equivalent, but much neater, definitions will now be given directly from 
Propositions 11 and 12. 

Let ⑴ be a connection on F(M) with torsion form 0 and curvature form S2. 
For X\,X 2 e M p , we let 

T(X u X 2 ) = u(@(X u X 2 )) e M p , 

where X{ e F{M) U are any vectors with This definition is inde- 

*In Ehresmann’s original treatment, the torsion and curvature forms @ and Q on 
F(M) were defined by the structural equations. The definition in terms of D was 
given by Ambrose and Singer, A Theorem on Holonomj, Trans. Amer. Math. Soc. 75 
(1953) ， 428-443. This paper also introduced much of the convenient terminology, like 
“horizontal vectors”. 
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pendent of the choice of u, and of Xi e F(M) U . To see this, first note that 
if X\ is replaced by X\ with 7t^(X\) = X \, then X\ — X\ is vertical, so 

S(X 1 -^ U X 2 ) = De(x l -^ u Xi) = o ； 

similarly ， X 2 may be replaced by any X 2 with 丌 * 以 2 ) = X 2 , without changing 
the value of u((3(X \, X 2 )). Then note that if we change m to m • A, we can pick 
Ra^i for the new A 7 , and we have 

(w • A)(S(R a ^X u Ra^X 2 )) = (u • A)(A~ X • 0(X,, X 2 )) by Proposition 12 

= u(S(X u X 2 )). 

Since 0 is a form, it is clear that T is skew-symmetric. 

Similarly, for Xj, A 2 , ^3 e M p we let 

R(X u X 2 )X,= u(Q(X u X 2 )^ 

here Q(X U X 2 ) e is an w x w matrix, so it acts on u~ x g M. n . Just as 

or X 2 . 

If we change u to u - A, then by Proposition 11 , 

( W • A)(n(R A a u RaM - ([u . A]- x X^)) 

=(w • A)([A~ l n(X u X 2 )A]^ (A~ l . u~\X^))) 

= u(Q(X u X 2 )(u~ l X 3 )). 

Since S2 is a 2 -form, it is clear that R is skew-symmetric in X\ and X 2 . 

In Chapter 7, we mentioned that the structural equations could be used to 

prove that the torsion and curvature tensors T and R, defined for a Cartan 

* * 

connection in terms of & and ， were just those derived from the V which 
could also be defined for the Cartan connection. Here we will actually carry 
out this proof for an Ehresmann connection. 

17. PROPOSITION. For any vector fields X, Y,Z on M we have 

T{X, Y) = V X Y-V Y X ~[X y Y] 

R(X y Y)Z = SJ X SJ Y Z- SJ Y SJ X Z- V {XiY }Z. 


before, we see that the definition does not depend on the choice of X\ 
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PROOF. Recall that we defined fy : F{M) by fy(u) 

If Y* is the lift of Y, we can write fy(u) == d(Y* u ). So Lemma 8 gives 

V x Y{p) = u(X* u (9(Y*))) for u e 丌 


Consequently, 

T(X p ,Y p ) = u(&(X\,Y\)) 

= u{D9{X\,Y\)) 

= u(de(X\,Y\)) 

=u{X* u (e(Y*))- Y\(e(X*))- 6{[X\ Y*](u))) 

= Y] p , 

since n^([X*,Y*]) = [X, Y\ 

To prove the second equality, we note that since X* and Y* are horizontal, 
we have 


Q(X\,Y\) = dco(X*,Y*)(u) 

= X*(co(Y*))(u) - Y*(co(X*))(u) - co([X\ Y*])(u) 
= ^co([X\Y*])(u). 

If we set the vertical component of [X*, Y*] equal to 

v[x\ r*](w) = g(N) u ror n e 

then we obtain 

n(X\,Y\) = -N, 

so 

(1) R{X P , Y P )Z P = u(Q(X\, r* M ) • (u~ l Z p )) 

=u{-N - fz{u)). 


On the other hand, since we also have fzM = 9(Z* U ), we obtain 



332 


Chapter 8 


(2) Vx^yZ(p) - VrVxZ(/7) - V[x,y]Z(p) 

=u(X\{Y*fz)~ Y\(X*f z ) - (h[X\ r*] M )/z) 

= u{{v[X\Y%)f z ). 

The expressions in (1) and (2) are equal, since 

o{N) u f z = lim - [/z(w • exptN) - fz(u)] 

t-^0 t 

=lim - [(exp W) - 1 . / z ⑻一 /z ⑻] 
o t 

=-N • /z(w). ❖ 

One of the steps in this proof is of sufficient importance to be stated explicitly, 
along with a counterpart, in the following corollary of the structural equations. 

18. PROPOSITION. If ⑴ is a connection on the bundle of frames F{M), and 
B\, B 2 are basic vector fields, then the horizontal component of [B\ , B^iu) is 
the value of B 2 U )) at u. 

If 出 is a connection on any principal bundle P over M, and Y\, Y 2 are 
horizontal vector fields, then the vertical component of [Ki, Y 2 ](u) is the value 
of cr(—(y\ w ， hw)) at w. 

PROOF. Since B\ and B 2 are horizontal, the first structural equation gives 

C m )(5i w ， B 2u ) = d9(B\, B2 ){u) 

=B x {9{B 2 )){u)- B 2 {e{B x )){u) ~ 9([B U B 2 ]){u) 

=—9(h[B\, ^2]( w )) since 0 = 0 on vertical vectors. 

Since we can always write h[B\ , B 2 ]{u) as B(^) u for some ^ G R n , the first result 
follows from Proposition 13(1). 

Since Y\ and Y 2 are horizontal, the second structural equation gives 

Yiu) = da){Y\, Y2 ){u) = —co([Y\, >2](w)), 
which gives the second result. 
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Suddenly, we are ready for 


19. THEOREM (THE TEST CASE ； SEVENTH VERSION). Let (M y { ? )) 
be an «-dimensional Riemannian manifold for which the curvature tensor R 
(for the Levi-Civita connection) is 0. Then M is locally isometric to M” with its 
usual Riemannian metric. 

PROOF. On the bundle of frames F(M) we have a connection co with 0 = 0 
and 二 0, for which parallel translation preserves the inner product (,). 

Step 1. The second part of Proposition 18 shows that the bracket of two hori¬ 
zontal vector fields is again horizontal. Thus, the distribution H is integrable. At 
a point p e M choose an orthonormal frame u G n~ l (p) y and let N be the 
(«-dimensional) integral manifold of H through u. Clearly, N is locally the 
image of a section s' U — F(M) with s(p) = u. 



Step 2. Consider the basic vector fields 5(e/), for e\,... ,e n the standard ba¬ 
sis of Since they are horizontal, the bracket [B(ei) y B(ej)] is horizontal. 
But the first part of Proposition 18 shows that the horizontal component is 0, 
so [B(ei), B(ej)] = 0. If we let Xi{q) = 兀 *(5(e/)( l y(q)))，then we also have 
[X h Xj]=0, since B(ei) and Xi are 丌 -related. But, by definition of B(^), we 
have 

7T*(5(Q)Cy^))) = s(q)(ei) 

=/ th vector of the frame s(g). 

Thus, ^ = (X\ y • • • ， X n ) where [Xi, Xj] = 0; hence there is a coordinate system 
. ,x n with Xi = d/dx f . 
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Step 3. We claim this is the desired coordinate system. We just have to show 
that s(q) is always orthonormal, so it suffices to show that s(q) is the parallel 
translate of w = s(p) along any curve c in f7 from p to q. This is obvious: 
to translate u along c, we choose a lift c* of c with c*(0) = m, and then the 
translate of u is c*(l); but clearly c* = s o c. 


To wrap things up, we present the new version of the Bianchi identities. 


20. THEOREM. For a connection co on the bundle of frames F(M), with 
dual form 0, torsion form 0, and curvature form Q, we have 

(1) (Bianchi’s first identity) D& = Q A 6. In other words，for X,Y,Z e 
F{M) U we have 

Z) = [Q(X, F). 6{Z) 0(Z) 

* * + Z) • 6{X) - Q{Z, Y) • 6{X) 

+ D(Z ， 义 ） • 6{Y) - Q{X,Z) • 6{Y)] 

= r). 6{Z) + D(r ， z). 6{X) + d(z, x) • e(Y). 


For a connection co on any principal bundle, with curvature form we have 
(2) (Bianchi’s second identity) DQ = 0. 


PROOF. Applying d to the first structural equation, d6 — —co 八 0 + 0, we 
obtain 


So 


0 = _ {d(x) A 0) -|- i^Ct) A dO) -|- d 0. 


DG{X,Y,Z) = d® 、 hX ， hY ， hZ) 

=(dco a 9)(hX ， hY ， hZ) — 0 
=(^2 a 6){X, Y, Z), 

since dco(hAjjB) = B) and 6(hA) = 0(A) for a.W A, B e F(M) U . 

Applying d to the second structural equation, dco — —co 八⑴ + we obtain 


0 = —(dco aco) (co a da)) + dQ. 
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So 

DQ(X,Y,Z) = dQ (hX, hY ， hZ) 

= (dco A co 、 (hX ， hY, hZ) — (^ a dco){hX, hY, hZ) 

= 0， 

since co(hA) = 0 for all A € F(M) U [but recall that {dco Aa>) ~ {a> a dco) is not 
itself 0, since matrix multiplication is not commutative]. 
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SUMMARY 

The diagram on the opposite page summarizes the relationship between the 
various definitions of a connection: 

(1) « 3 functions rg assigned to each coordinate system (classical), 

(2) V operator on vector fields (Koszul), 

(3) nxn matrix of 1-forms {(x) l j) assigned to each moving frame (E. Cartan )， 

(4：) n x n matrix-valued 1-form co on F(M) (Ehresmann), 

(5) distribution H on F[M). 

The relationship between (1) and (2) is immediate, as is the relationship be¬ 
tween (4) and (5). In Chapter 7 we saw how to pass between (1)-(2) and (3 )， 
and in this chapter we saw how to pass between (4)-(5) and (3). We can also go 
directly from (5) to (1)，since V is defined by parallel translation. It remains to 
indicate how one passes directly from (1)-(2) to (5). 


21. PROPOSITION. Let V be a Koszul connection on M. For u e F(M), 
let 

H u = {s^Xp : p = n(u) and Vx p s — 0} 

[where ^x p s denotes (Vx p X \,..., ^x p ^n) if ^ = (% ， ••• ， X n )\, Then H u is a 
subspace of P u , and the distribution w // M defines a connection on F{M) 
for which the covariant derivative is the given V. 

PROOF. Suppose that s a is another section with a(p) = I and Vj ^{s -a) = 0. 
Then 

(1) 0 = V Xp (s •a) = s ^ ⑷ + (V Xp s、• a 

=s - A ⑷， 

which implies that X p (a) is the zero matrix. So by Proposition 3 we have 

(s - a)^(X p ) = s^X p + o{X p {a)){s{p)) 

— ^ ^p • 

Thus H u contains exactly one vector X* u with 7t^(X* u ) = X p . Now given 
vectors A Uy B u e H u , let it^A u = X p and n^B u = Y p . We can choose s with 
s{p) = u and = Vy〆= 0; then we also have Vx p ^y p s = 0, and we must 
have 


~ Yp = ~h Yp) G Hu ， 
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Thus H u is a subspace.* It has the same dimension as M，and contains no 
vertical vectors except 0, so we have F(M) U — ㊉ // w . It is clear that Ra*H u = 
H u .a for all A e GL(n, M)，since s - A also satisfies .^4=0. 

To prove that // is a C°° distribution, we consider a C°° vector field X on M ， 
and a C°° section s. We claim that there is a function a: U —> GL(«, M)，defined 
in some open set f/，such that Vx p - a) =0 for all p e U. This is because 
equation (1) shows that this condition is equivalent to 

0 = s • X p {a) + (Vj〆）_ «， 


and this is a differential equation for a. For each A e GL(« ， M) and p e U, let 
ci p ,a be the solution with a Pt A{p) = s(p) - A. For every u e F{M) we have 
u = s(n{u))^ A{u) for some A{u) e GL(n, M) which depends differentiably on u. 
Then the vector field 

is C°°, since a Pi A is C°° in p and A. 

Thus the distribution // is a connection on F(M). For a curve c in M ， 
consider a curve c* in F{M) with jt o c* = c and all c^ f {t) horizontal. By 
definition of the connection, this means that for some X p e M p = M c (/) we 
have 

c*\t) = s^{Xp) where ▽心 s = 0. 

Of course, X p must be jt^s^X p = jt + c* / (0 = c f (t). So V c w = 0. From 
this it is easy to see that the components of c* are vector fields along c which 
are parallel along c, with respect to the original V. Hence parallel translation 
defined with respect to the connection H is the same as parallel translation 
for V. This implies that the covariant derivatives are also the same. ♦ 

Proposition 21 gives the most geometric way of going from V to //, and 
hence to co. We can also give a computational description of co, in terms of a 
coordinate system (x, U) on M. Recall that = (x ( o jr ， xj) is a coordinate 
system on jt _ 1 ([/), where we write a frame u as 

n g 

“厂1>)⑻. [7 . 

• » 

At each w, the non-singular matrix (xj (w)) has an inverse matrix (>y(w))= 
(xj(w))- 1 . 


* We can also define H u simply as s^{M p ) y where ^ is a section with s(p) = u and 
{Vs){p) = 0. 
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22. PROPOSITION. Let rg be the w 3 functions assigned to the coordinate 
system (x, U) by a classical connection on M. For the corresponding Ehres- 
mann connection co = ((o l j) = J^coj - Ej we have 


i,j 


0) 


o 7T)xJ d(x^ O 7T)^ on n~ l (U). 


PROOF. Let 5* = (3/3X 1 ， … ， d/dx n ) be the natural section. We know that the 
matrix of 1-forms (a) s l j ) for the corresponding Cartan connection is given by 


r lj = K) 


dx^ ) 9 


and consequently the corresponding Ehresmann connection satisfies 


( 1 ) 





a 


dx^ 


p 



If we write 




dx 从 




p 


3(X / o 7r) 






then 


a 


a = 


dx^ 


(x l o n) 


d(x l o n o s) 




p 


dx 从 


p 


dx l 

dx^ 


8 l . 




p 


b\ 


Si, 


3x 从 




3(X; o 5*) 


P. 


dx^ 


dx^ 


p 


0. 


p 


so 


a 




dx^ 


a 


p, 


3(X^ o 7T) 


s(P) 


A similar calculation shows that for any A e GL(n, M) we have 


( 2 ) 


a 


d(x^ O 7r) 






a 


* 


s(p)A 


dx^ 


Ra 


a 


* 


P 


dx^ 


p 
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Now any u G n~ l (U) can be written u : = s{n{u)) - (xj ⑻） = = s{p) - A, say. So 
for the matrix we have 


K) 


a 


3(X^ o 7r) 


j = (—)(^4* 




p 




A 


p 


by (2) 


by definition of an 
Ehresmann connection 


A-\rlj(p))A 


by (1 )， 


so that 


a 


(O 


d(x^ O 7r) 


= ^ A ~ l yA^ A j 

u/ 入 ， V 

= 少 i ⑻厂^兀⑻以/⑻- 
入 ， V 

This accounts for the coefficient of d(x^ o n) in the desired expression for (Oj. 
Now let us write 




u 


(clearly each d/dxj is vertical，since 7T + (3/3xy )(/) = 3(/ on)/dXj = 0, as f on 
is constant on fibres; hence the d/dxj 1 span V u ). Then 


a 


X 


^(Ep(xf)(u) 
%*( 五》)(义夕） 


xj(u - exphE^) — xHu) 


入 


h 


lim 

h—^0 

\j(u - Ep = : A th component with respect to 5*(7r(w)) of (w • E^) 

=，• •’ ” ，• ••$>〆 观 

y 

_ ■ ■ ” •* ■ ■ ■ ■ ” • • “ ^(X 


4 5 r 


So 


o(Ep(u) = D 


入 W 


U 



Hence 
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= (Ep i J =a> l j (a(E^)(u)) 



(亂) . 


This is easily seen to account for the other term ，Ylx dxj ， in the expression 
for 吟 ♦ 
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ADDENDUM 1 

THE TANGENT BUNDLE 
OF THE BUNDLE OF FRAMES 

The existence of a connection on the bundle of frames F(M) has some in¬ 
teresting implications. 

23. PROPOSITION. If there is a connection on F(M), then the tangent bun¬ 
dle of F{M) is trivial. 

PROOF. The n 2 + n vector fields o{E l j) and B{ei) are everywhere linearly 
independent. 

24. COROLLARY. If there is a connection on F(M), then M is paracompact. 

PROOF. Since the tangent bundle of F(M) is trivial, there is clearly a Rie- 
mannian metric on F{M). So F(M) is metrizable (Theorem 1.9-7)，so every 
component of F{M) is a-compact (Theorem I. A-l). Since n : F{M) M is 
a continuous map onto M, it follows that each component of M is a-compact, 
so M is metrizable (Theorem I. A-l). ♦ 

Of course, a non-paracompact manifold M may still have a connection in 
some principal bundle n : P ^ M with group G, For example, the trivial bundle 
M x G ^ M has an obvious connection (the horizontal vectors Y are those 
with 7T2*F = 0, where ni : M xG ^ G is projection on the second coordinate). 

If M is paracompact，then there is a connection in any principal bundle 
7T : M ; this can be proved using partitions of unity, noting that any convex 

combination of Ehresmann connections is also a connection (a convex combi¬ 
nation of connections a>i makes sense, since the values of ⑼ are in the vector 
space g). 

For the special case of a Riemannian manifold (M, { ， 〉)，we have a specific 
connection in F(M), the Le'.i-Civita connection. It is easy to see that the Levi- 
Civita connection for a Riemannian metric can also be defined for an indefinite 
Riemannian metric (a non-degenerate inner product in each tangent space); it 
is the unique symmetric connection for which parallel translation is an isometry, 
and the are gi\*en by exactly the same formula as the Christoffel symbols. 
So Corollary 24 implies 
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25. COROLLARY. If M has an indefinite Riemannian metric, then M is 
paracompact. 

This result cannot be proved by modifying the proof of Theorem 1.9-7, for 
there may be paths of negative length between two points. For another proof 
of the result，see Sachs and Wu ，General Relativity for Mathematicians, section 8.2.1. 
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ADDENDUM 2 

COMPLETE CONNECTIONS 

If is a connection on F(M), we define a geodesic, as usual，to be a curve c 
such that dc/dt is parallel along c. The connection co is complete if every 
geodesic segment can be extended to M. Unlike the Levi-Civita connection for 
a Riemannian metric, a general connection on F(M) may not be complete 
even if M is compact. To construct an example, we begin with the Levi-Civita 
connection for the metric 〈 ， ) = e x dx ^ dx on M. A geodesic c has constant 
squared length 

〈 〆(,) ， 〆(,)》“” = e c{t) c f (t) 2 , 
so e c ^^ 2 c f (t) must be constant, and thus 

c(t) = 2 log(a + bt). 

Thus，（ ，〉 is not complete (the geodesics run off to infinity in a finite amount 
of time). If we identify the bundle of frames ^(M) with M x (M — { 0 })，then the 
integral manifolds of the horizontal distribution H of our connection are the 
sets 

{(ciO.c'it))} = |(2 log( 6 t + &) ， 邊 7 )} 

= {(x,2be~ x/2 )}. 

Now we will identify x with x + 1 for all x G M; the resulting manifold 
is a circle 5 1 and we have a map 7T: M ^ S 1 given by 7r(x) = equivalence 
class of x. The distribution H on ^(M) gives rise to an obvious distribution 
on F(S l ), which determines a connection on F(S l ). The geodesics for this 
connection are the curves 

7t o c(t) = ix(2 log(a -h bt)). 

They cannot be extended to all of M, since they go around S 1 infinitely often 
in a finite amount of time. 

There are other anomalies for general connections. For example, even though 
a connection co is complete, it may not be possible to join every pair of points 
with a geodesic. If we consider a Lie group G，it is easy to see that there is a 
unique connection a) on F(G) which makes all left invariant vector fields paral¬ 
lel. Then geodesics through e are just 1-parameter subgroups. The connection 
is complete, since 1 -parameter subgroups can be extended to all of M, but (Prob¬ 
lem 1.10-27) it is not necessarily true that every element lies on a 1 -parameter 
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subgroup. It is not known whether every two points can be joined by a geodesic 
in a compact manifold with a complete connection. 

Finally, it should be pointed out that results about geodesics for the Levi- 
Civita connection of a Riemannian metric need not hold for the Levi-Civita 
connection of an indefinite Riemannian metric. For example, there is an indef¬ 
inite metric whose Levi-Civita connection is complete but for which not every 
pair of points can be joined by a geodesic (see J. W. Smith, Lorentz structures on 
the plane, Trans. Amer. Math. Soc. 95 (I960) ， 226—237). 
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ADDENDUM 3 

CONNECTIONS IN VECTOR BUNDLES 

Suppose 7T: £ ^ M is a C°° vector bundle over M. We have already seen 
how to form a principal bundle m : F(E) — M，in which* every u e m~ x {p) 
is a frame for n~ l (p). We can then define a connection in the original vector 
bundle 五 to be a connection in the principal bundle F(E). However，there is 
a more direct way of defining a connection in £*，which makes certain things 
work out more simply, since the bundle of frames contains a lot of superfluous 
stuff in it. 

For a vector bundle n: E — M (as for a principal bundle) we define the 
vertical subspace V e C E e at any point e e E to be the image (p) e ), 

where p = 7t(e); vectors in V e are called vertical, and V e is clearly the kernel 
of 丌 * : E e — Mp. For every frame u e and every 专 e we also have 

w(§) G n~ x {p) defined by w(§) = • w’. Finally, for every non-zero o: g M, 

we let a: ^ be the diffeomorphism defined by a(e) = a ^ e. We define a 

connection in £* to be a distribution H such that 

(1) ㊉ K for 2i\\ e e E 

(2) a^(H e ) — Ha( e ) ~ e for all non-zero a e E 

(3) // is a C°° distribution. 

The subspace H e is called the horizontal subspace at e, and vectors in H e are 
called horizontal. Applying (2) with any a: #1， and using a local trivialization ， 
we see that if 5*: M ^ is the zero section, then H s 、 p ) must be s^{M p ). 



We will first show how such a connection arises from a connection a) on the 
principal bundle F(M). Let H be the distribution on F(M) determined by a). 

* This bundle is a special case of the “associated principal bundle”，which is used in the 
theory of fibre bundles. 
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Given e e 7T -1 (p), choose any frame u e it~ x (p); then there is a unique ^ eM. n 
with u(^) = e. Now we can define a map#: F(E) Eby^(v) = u(^)forall 
v e F(E); this map takes fibres of F(E) to fibres of E. We let H e = (j>^{H u ). 
This is well-defined，for if we choose u - A instead of u, then we must choose 
A~ l - § instead of since 

<P a - 1 ^(v) = v(A - 1 ^) = v^A - 1 (^) 7 

we have . 专 = 办 ◦ , so 

(^A~ J -$)*(^u-a) = Ra-' * 、 H U .A 、 

= 0 专 *//“， by Proposition 4. 

We leave it to the reader to verify that these well-defined H e do satisfy conditions 

(1)，(2)，(3). I # 

We also want to show that every connection H in E does arise in this way from 
some connection H in F(E). To do this, we first consider parallel translation. 
If c: [0,1] —► M is a curve, then the parallel translation 

Tr ： 7r _1 (c(0)) ^ n~\c{p)) 

determined by H is defined in the obvious way: r t (e) = c^(t) where : [0,1] ^ 
E is the unique curve with c 卞 ⑼ = =e such that each c^(t) is horizontal; the 
existence of in E is proved similarly to the existence of c* in a principal 
bundle. Clearly x t is a diffeomorphism，whose inverse is the parallel translation 
along the reversed part of c from c(t) to c(0). From condition (2) for H it 
follows that z t (a • e) = a - r t (e) for a ^ 0. This is true even for or = 0, i.e. ? r t 
takes the zero vector at c(0) to the zero vector at c(t); this follows from the fact 
that H s ^ p ) = s^{M p ) when s is the zero section. It now follows that r t is a vector 
space isomorphism, because of the following 


CLEVER OBSERVATION ： If/: ^ IR m is differentiable at 0 and satisfies 

f (a • v) = af (v) for all u g and or e M，then / is linear. 

PROOF. Let T = Df(0) : ^ M m . Then 




= lim/(,) = /(,).❖ 
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We now define the connection H in F(E). Let u € For every C°° 

curve c: [0,1] — M with c(0) = p, we define the curve c*: [0,1] ^ F(E) by 

the subscript i denoting the ; th component of the frame. Then we define H u to 
be the set of all vectors c*(0) for all such curves c. The reader may verify that H 
is an Ehresmann connection, and that it gives rise to the connection H on E (the 
fact that x t is a vector space isomorphism is used to prove that Ra*H u — H u .a). 

Given a connection H on E, we use the direct sum decomposition E e = 
H e ® V e ， to define the horizontal component hX and vertical component vX of 
any tangent vector X e E e . Condition (2) for H is equivalent to the equation 
a^hX = h(a^X), or to a^vX = v(a^X), Remember that the tangent space 
n~ l (p) e at e of the vector space n~ l (p) can be identified with n~ l (p) itself. 
Since vX € (p) e ), we therefore can, and henceforth will, regard vX as 

an element of (p). 

The equation a^vX = v{a^X) then becomes 
(*) v(a^X) = a - vX. 

Now consider a section s : M ^ E oi the bundle n: E ^ M, and a vector 
X p € M p . We define 

^X p s = vs^{Xp) € n~\p). 

It is clear that is linear in X p and in s. For a C°° function /: M ^ E, 
the analogue of Proposition 3 is the formula 

(**) (/ • s)^(X p ) = f(p)^X p ) + X p (f) - s(p), 

where s(p) € n~ x (p) is identified with a tangent vector in n~ l (p) at / (p) s(p); 
to prove this formula, we introduce a local trivialization, and observe that it 
becomes the product rule for the derivative. If we take the vertical component 
of both sides of (**) and use (*), we find that 

▽A>(/ •扣 Ap)^x p s + x p (f) . s(pl 

Thus V is a Koszul connection. If the connection H in E comes from the 
connection H in F(E), then this V is the same as that determined by H: this is 
an easy consequence of Lemma 8 (although Lemma 8 is concerned only with 
F(M) = F(TM). it is easy to see that it generalizes to any F(E)). 

Finally, we point out that if we are given V, then H e is simply : p = 

n(e) and = 0}: it can also be defined as s^(M p ), where 5* is a section such 
that Vs(p) = 0. 
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ADDENDUM 4 
FLAT CONNECTIONS 

Consider the trivial principal bundle n: M x G ^ and let 712 ： M x 
G ^ G be projection on the second factor. We can define the canonical flat 
connection H on M x G by letting = ker7r2 + : (M x G)( p ^ a ) G a . 

It is easy to see that the corresponding cj-valued 1-form co on M x G is given 
by (o = where is the natural c^-valued 1-form on G (defined on 

pg. 1.403). Using the equations of structure of G, in the form given on pg. 1.404, 
we now have, for all Fi ， F 2 e (M x G) u 

d(o(Y u Y 2 ) = dn 2 ^a) f )(Y u Y 2 ) 

= n 2 *(d(o / )(Y u Y 2 ) 

— da) f (7T2^Yi,7T2^Y 2 ) 

= -[⑴’(丌 2*0’ (丌 2* G )] 

= -MY x )^(Y 2 )l 

Comparing with the structural equations of the principal bundle M x G (The¬ 
orem 16)，we see that for this connection co we have ^ = 0. 

Conversely, suppose n : P — M is a principal bundle with group G, and a 
connection co such that Q = 0. By Proposition 18， the distribution H corre¬ 
sponding to co is integrable. From this it is easy to see that around any point 
p e M there is a neighborhood U and a diffeomorphism t : n~ l (U) U x G 
of the form t(u) = (7T(w )， 0(w)), where (f>(u - a) = • a ， such that U{H u ) 

is the horizontal subspace at t(u) for the canonical flat connection in U x G. 
(One part of our final proof of the Test Case essentially used this fact.) 

It should be noted that the second structural equation of Euclidean space 
(Proposition 7-1), with which we began our whole investigation of moving 
frames, could have been deduced from the equations of structure of GL(«, M), 
by a process essentially equivalent to the deduction just given for the equation 
d(o(Y\, Y 2 ) = —[ ⑴ (Fi), (o(Y 2 )]. To obtain the first structural equation, we would 
have had to consider the equations of structure for the group of affine motions 
[an “affine motion” is a translation followed by an element of GL(«, M)]. In 
general, for a connection co on F(M), the torsion form and the first structural 
equation for (o can be interpreted in terms of a connection in the bundle A{M) 
of affine frames of A/, where an “affine frame” of M p is a pair (u, wi，.. • ， u n ), 
for v e M p and (wi,..., u n ) a frame for M p . For this interpretation, the reader 
is referred to Kobayashi and Nomizu, Foundations of Differential Geometry, Vol. 1, 
pp. 125-130. 
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ond fundamental form 
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Gram-Schmidt orthonormalization, 
43, 260 

Great circles, triangle of, 134 
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Helix, 32 

“left handed”，33 
“right handed”，33 
rotation of, 33 
Hessian, 201 
Horizontal 

component, 317, 348 
curve, 318 
subspace, 315, 346 
vector, 315, 346 
Huygens, G., 9 
Hyperosculating conic, 43 


Immersion, 1 

Indefinite Riemannian metric, Levi- 
Civita connection for, 342 
Index, rotation, 19 
Inequality 
Schwarz, 135 
triangle, 203 

Infinitesimal triangle, 74, 100 
Inside of a curve, 15 
Integrability conditions, 262 
Invariance of curvature under proper 
Euclidean motions, 39 
Invariants, complete set of, 39 
Inward pointing normal, 52 
Isothermal coordinates, 297 
Isotropic, 291 


Jacobi identity, 245 


Kobayashi, S., 349 
Kobayashi and Nomizu, 349 
Koszul, J. L., 227 
Koszul connection, 227 
Kulkarni, R. S., 279 


Left handed helix, 33 
Left, act on, 262 
Leibniz, G. W., 9 
Length of curve, 200 
Levi-Civita, T., 209, 234 
Levi-Civita connection, 238 

for an indefinite Riemannian metric, 
342 

Lie group 

equations of structure of, 349 
review of, 36 ff. 

Lift 

of curve, 318 
of vector field, 317 
Linear group, special, 38 
Lobachevsky, N. I., 163 
Local triviality, 307 
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Manifold 
flat, 179, 184 
Riemannian, 165 
Matrix notation, modified, 261 
Mean curvature, 133 
Metric 

determined by the curvature, 277 ff. 
Finsler, 165, 202 

Levi-Civita connection for indefinite 
Riemannian, 342 
Minkowski, 200 
Meusnier, J. B., 52 
Meusnier 5 s Theorem, 54, 123 
Milnor, J. W., 195 

Minimax definition of eigenvalues, 126 
Minkowski metric, 200 
Modified matrix notation, 261 
Motion 
affine, 349 
Euclidean, 11, 34, 43 
special affine, 39 
Moving frame, 259 
adapted, 270 
natural, 259 

orthonormal, 260, 264, 266 


Natural moving frame, 259 
Natural parameter 

for curves under the group of Eu¬ 
clidean motions, 38 
for curves under the group of special 
affine motions, 40 
Natural (^-valued 1-form, 36 
Newton, I., 9 

Nomizu, K., 349; see also Kobayashi 
and Nomizu 

Non-Euclidean geometry, 163, 301 
Norm 

Banach space, 203 
Normal 

coordinates, 302 ； see also Riemann- 
ian normal coordinates 
map, 112 
plane, 30 


vector 

inward pointing, 52 
unit, 62 


Opposite direction, curve going in, 7 
Orthonormal moving frame, 260, 264, 
266 

Orthonormalization, Gram-Schmidt, 
43, 260 
Osculate, 6 
Osculating 
circle, 6, 27 
parabola, 43 
plane, 25 


Parabola, osculating, 43 
Parallel 

translation of fibre along curve, 319 
translation of vector along closed 
curve, 243 

translation of vector along curve, 
234 

vector field, 243 
vector field along curve, 234 
Parallelepiped, volume of, 56 
Parameter 

natural for curves under group of 
Euclidean motions, 38 
natural for curves under group 
of special affine mo- 
tionslOparameterized surface , 
238 

vector field along, 238 
“Partial derivatives”，272 
Permutation, sign of, 144 
Plane 

curvature of, 115 
normal, 30 
osculating, 25 
rectifying, 30 
Point, 37 
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Polar coordinates 
geodesic, 290 
curvature in, 138 
structural equations in, 272 
Polarization, 185 
Principal axes, 205 
Principal bundle, 306 
action of group of, 306 
associated, 346 
base space of, 306 
connection in, 315 
local triviality of, 307 
over paracompact spaces, connec¬ 
tions in, 342 
projection map of, 306 
total space of, 306 
trivial, 307 
Principal normal, 28 
Projection map of a principal bundle, 
306 

Projections of a curve, 30 
Projectively equivalent connections, 
250 

Proper Euclidean motion, 11 
Puiseux, V A. (Formula of Bertrand 
and Puiseux), 147 
Pythagorean Theorem, 204 


Rectifying plane, 30 
Repere mobile, 259 

fundamental principle of, 270 
Ricci, G, 209 
Ricci Calculus, 209 ff. 

Ricci’s identities, 214, 239 
Ricci’s lemma, 213 
Riemann, G.F.B., 149 ff, 209, 295, 
301, 302 

Riemann curvature tensor, 189 
Riemannian manifold, 165 
locally isometric, 165 
structural equations of, 267 
Riemannian metric, 165 

connection compatible with, 236 
determined b\ the curvature, 277 ff. 


indefinite, 342 

Levi-Civita connection for , 342 
Riemannian normal coordinates, 166 
Right 

act on, 262 
act on M on, 309 
Right handed helix, 33 
Rotation, 11, 34 
index, 19 
of a helix, 33 
through an angle of 6^ 50 


Scalar triple product, 29 
Schur, F., 291 
Schwarz inequality, 135 
Second curvature, 28 
Second fundamental form, 122 
symmetry of, 123 
Second structural equation, 327 
of Euclidean space, 349 
Section, 311 

Sectional curvature, 194 

preserving diffeomorphism, 279 
Self-adjoint, 125 
Serret, J. A., 34 
Serret-Frenet formulas, 34 
Sign of a permutation, 144 
Simple curve, 12 
Singer, I. M., 329 
Smith, D.E., 150 
Smith, J.W, 345 
Space curve, curvature of, 27 
Special affine 

arclength, 39, 46 
curvature, 41 
constant, 41 
curvature functions, 47 
motion, 39 

Special linear group, 38 
Spectral theorem, 125 
Sphere 

curvature of, 115 
unit, of Minkowski metric, 200 
Stokes’ Theorem, 143 
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Structural equation(s)，287 
first，327 

in polar coordinates, 272 
of Euclidean space, 261 
first，349 
second, 349 

of Riemannian manifold, 267 
second, 327 

Subgroups of GL(«,M), a) for, 37 
Subspace 

horizontal, 315, 346 
vertical, 307， 346 
Summary of different definitions of 
connections, 337 
Support line, 13 
Surface 

curvature determines metric, 279 
geodesic on, 135 
parameterized, 238 
Symmetric connection, 222, 238 
Symmetry of second fundamental 
form, 123 

Tangent bundle of F(M) ? 342 
Tangent space of 0(«)，35 
Taylor expansion of a curve, 27, 31 
Taylor polynomial approximations of 
gij> 165 

Tensors，Calculus of, 209 
Test Case, 179 
first version，197 
second version, 217 
third version, 241 
fourth version, 268 
fifth version，274 
sixth version、275 
seventh version、333 
Theorema Egregium, 132， 143 
Torsion，28 
form, 325 
forms，283 
formula for, 29, 30 
tensor ， 221 ， 236 ， 286、 330 
Torus ， flat, 179 
Total curvature, 18 


Total space of a principal bundle, 306 
Triangle 

geodesic, 141 
inequality, 203 
infinitesimal, 74， 100 
of great circles, 134 
Triple product, scalar, 29 
Trivial principal bundle, 307 
Two-dimensional manifolds; curvature 
determines the metric, 279 
Two-parameter variation、256 

Unit 

ball, of Finsler metric, 203 

normal vector, 62 

sphere, of Minkowski metric, 200 

Variation 

two-parameter, 256 
vector field，247 
Vector 

horizontal ， 315, 346 
vertical ， 307， 346 
Vector bundle, connection in, 346 
Vector field 

along a curve, 231 

covariant derivative of, 232 
along a parameterized surface, 238 
basic, 325 
fundamental, 311 
lift of, 317 
Velocity vector, 247 
Vertex of a curve, 23 
Vertical 

component, 317、 348 
subspace ， 307, 346 
vector, 307、 346 
Volume of a parallelepiped, 56 

Weber 、 H. ， 150, 170 
\\ eingarten 
equations、124 
map、122 
\Veyl ， H. ， 170, 251 



